Repository navigation

#

text-to-speech

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python
50159
18 天前
unslothai/unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

Python
44351
3 小时前
babysor/MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python
36558
9 个月前

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python
34070
4 个月前
nari-labs/dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python
18040
1 个月前

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。

Python
13787
1 天前

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Jupyter Notebook
9960
2 年前

A fast, local neural text to speech system

C++
9860
1 个月前
open-mmlab/Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Python
9302
3 个月前

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Python
8873
15 天前

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python
7909
2 年前

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python
7624
2 年前

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, support 12 programming languages

C++
7112
15 小时前

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Python
6649
8 个月前