Repository navigation

#

stt

khoj-ai/khoj

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

Python
30757
2 天前

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Jupyter Notebook
5442
2 年前

Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式

Python
3729
16 天前

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

Svelte
2588
5 天前

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

C++
2499
1 年前

🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks

Python
2171
2 年前
lenML/Speech-AI-Forge

🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

Python
1329
12 天前
Robitx/gp.nvim

Gp.nvim (GPT prompt) Neovim AI plugin: ChatGPT sessions & Instructable text/code operations & Speech to text [OpenAI, Ollama, Anthropic, ..]

Lua
1250
9 天前

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.

C++
1061
5 天前

小智ESP32的Java企业级管理平台,提供设备监控、音色定制、角色切换和对话记录管理的前后端及服务端一体化解决方案

Java
774
5 天前

Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) (VTuber TTS)

C#
703
1 个月前

🎤 Lobe TTS - A high-quality & reliable TTS/STT library for Server and Browser

TypeScript
650
3 个月前

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

JavaScript
639
1 年前

On-device streaming speech-to-text engine powered by deep learning

Python
634
7 天前

Running speech to text model (whisper.cpp) in Unity3d on your local machine.

C#
602
4 个月前