Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.
-
Updated
May 30, 2026 - Python
Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.
Privacy first, AI meeting assistant with 4x faster Parakeet/Whisper live transcription, speaker diarization, and Ollama summarization built on Rust. 100% local processing. no cloud required. Meetily (Meetly Ai - https://meetily.ai) is the #1 Self-hosted, Open-source Ai meeting note taker for macOS & Windows.
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
The open-source ElevenLabs alternative for local voice cloning, design, create, dubbing and dictation Desktop App
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI
VOICE → WORDS
Instant, controllable, local pre-trained AI models in Rust
Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
A python package to build AI-powered real-time audio applications
an editor for spoken-word audio with automatic transcription
OBS plugin for local speech recognition and captioning using AI
Simple GUI for ByteDance's Piano Transcription with Pedals
「硬地骇客 - 两个月 $12000 ARR 实践之路」是由 硬地骇客 团队编著,本书是关于 Podwise 产品历程的忠实记录:内容包含 灵感 - 构建 - 发布 - 增长 - 复盘 五个章节。如果你觉得一个人读不够过瘾,欢迎加入「硬地骇客」官方知识星球与专家们一起讨论!Podwise 的故事才刚刚开始,我们也将在星球持续分享我们的认知,成功可能无法复制,但失败一定可以借鉴。现在就点击下方链接加入吧!
🎙️ AI Dictation App - Open Source and Local-first ⚡ Type 3x faster, no keyboard needed. 🆓 Powered by open source models, works offline, fast and accurate.
Local speech-to-text for macOS on-device AI, fully private, optional cloud
Add a description, image, and links to the transcription topic page so that developers can more easily learn about it.
To associate your repository with the transcription topic, visit your repo's landing page and select "manage topics."