Repository navigation

stt

Website
Wikipedia

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

semantic-search Emacs Obsidian chat ChatGPT 人工智能大语言模型 productivity agent 自托管 rag whatsapp-ai offline-llm llamacpp llama3 image-generation stt assistant research

Python

31232

1830

19 天前

alphacep / vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Jupyter Notebook

13328

1581

24 天前

snakers4 / silero-models

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

speech-recognition speech-to-text stt asr pretrained-models english german spanish stt-benchmark PyTorch colab onnx text-to-speech speech speech-synthesis tts

Jupyter Notebook

5498

343

2 年前

jianchang512 / stt

Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具，输出json、srt字幕、纯文字格式

speech speech-recognition speech-to-text stt

Python

3885

413

1 个月前

pluja / whishper

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

人工智能 audio-to-text Go subtitles sveltekit transcription Whisper ui Web app speech-recognition speech-to-text stt Web

Svelte

2683

152

2 个月前

coqui-ai / STT

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

stt speech-to-text Tensorflow 深度学习 automatic-speech-recognition asr voice-recognition speech-recognition

C++

2514

298

2 年前

pannous / tensorflow-speech-recognition

🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks

Tensorflow speech-recognition 神经网络深度学习 stt speech-to-text

Python

2173

635

2 年前

neural-maze / ava-whatsapp-agent-course

Meet Ava, the WhatsApp Agent

agent agentic-workflow agents stt tts vector-database

Python

1517

385

5 个月前

coqui-ai / open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

tts stt speech-to-text text-to-speech speech-recognition speech-synthesis speech-processing voice-recognition voice-activity-detection voice-cloning speech-separation

1358

148

1 年前

lenML / Speech-AI-Forge

🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

chattts tts agent gpt 大语言模型 text-to-speech colab llama chinese english cosyvoice asr stt Whisper

Python

1346

183

19 天前

Robitx / gp.nvim

Gp.nvim (GPT prompt) Neovim AI plugin: ChatGPT sessions & Instructable text/code operations & Speech to text [OpenAI, Ollama, Anthropic, ..]

copilot Neovim speech-to-text Whisper Vim codeium Lua voice 大语言模型 ollama claude gpt4o gpt-4o sonnet gemini mistral perplexity stt parrot

Lua

1272

116

2 个月前

R3gm / SoniTranslate

Synchronized Translation for Videos. Video dubbing

audio-processing diarization translation translate-audio translate-video video-dubbing asr automatic-dubbing document-translator dubbing speech-to-text stt text-to-speech tts

Python

1238

278

1 个月前

mkiol / dsnote

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.

asr stt tts Linux nmt offline translator machine-translation speech-recognition speech-synthesis speech-to-text text-to-speech translation

C++

1159

1 个月前

joey-zhou / xiaozhi-esp32-server-java

小智ESP32的Java企业级管理平台，提供设备监控、音色定制、角色切换和对话记录管理的前后端及服务端一体化解决方案

ESP32 Java mcp mcp-client mcp-server spring-ai stt tts xiaozhi xiaozhi-ai xiaozhi-esp32 xiaozhi-server

Java

888

320

10 小时前

snakers4 / open_stt

Open STT

speech-to-text russian dataset stt asr automatic-speech-recognition

Python

806

4 年前

VRCWizard / TTS-Voice-Wizard

Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) (VTuber TTS)

tts speech-to-text speech-recognition VRChat osc Discord free voice vtuber chatbox Spotify stt text-to-speech

713

1 个月前