Deepgram voice keyboard: a Linux virtual keyboard powered by Deepgram Flux
A voice-controlled Linux virtual keyboard using Deepgram's Flux turn-taking STT API, built in Rust.
Tag
7 posts
A voice-controlled Linux virtual keyboard using Deepgram's Flux turn-taking STT API, built in Rust.
A GUI tool for collecting audio training data for ASR fine-tuning, with LLM-generated prompts and Hugging Face integration.
An MCP server for audio transcription using multimodal LLMs like Gemini, GPT-4o Audio, and Voxtral — not traditional ASR.
An MCP server that brings Gemini-powered audio transcription directly into Claude Code and Claude Desktop.
A desktop transcription app that sends audio directly to multimodal AI models for single-pass transcription and formatting.
A local voice typing app for Linux/Wayland using NVIDIA's Parakeet model. No cloud, no GPU, built-in punctuation.
A snapshot comparing Hebrew TTS quality across six providers, including voice cloning experiments via Replicate.