Repository navigation
stt
- Website
- Wikipedia
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Meet Ava, the WhatsApp Agent
Synchronized Translation for Videos. Video dubbing
Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.
Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) (VTuber TTS)
💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
On-device streaming speech-to-text engine powered by deep learning
🎤 Lobe TTS - A high-quality & reliable TTS/STT library for Server and Browser
A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress
Running speech to text model (whisper.cpp) in Unity3d on your local machine.