Repository navigation

#

speech

babysor/MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python
36558
9 个月前
huggingface/datasets

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

Python
20525
18 小时前

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python
17365
2 个月前

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook
16801
1 年前

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell
15057
1 个月前

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python
10189
1 年前

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Jupyter Notebook
9960
2 年前
modelscope/modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Python
8264
2 天前

Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.

Python
6932
7 个月前

💬 Speech recognition for your site

JavaScript
6664
1 年前

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Jupyter Notebook
5442
2 年前

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook
4853
2 天前

A fast multimodal LLM for real-time voice

Python
4145
2 天前
Python
4145
4 个月前

Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式

Python
3729
15 天前