Repository navigation

#

speech

babysor/MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python
36146
5 个月前
huggingface/datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Python
19993
3 天前

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook
16147
7 个月前

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python
15039
7 天前

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell
14779
3 个月前

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python
10135
9 个月前

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Jupyter Notebook
9786
1 年前
modelscope/modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Python
7723
5 天前

Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.

Python
6922
3 个月前

💬 Speech recognition for your site

JavaScript
6660
8 个月前

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Jupyter Notebook
5236
2 年前

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook
4399
1 个月前

A fast multimodal LLM for real-time voice

Python
3846
2 个月前