Repository navigation

voice-cloning

Website
Wikipedia

CorentinJ / Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

深度学习 PyTorch Tensorflow tts voice-cloning Python

Python

57914

9300

12 天前

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

text-to-speech tts vits voice-clone voice-cloneai voice-cloning

Python

51314

5644

25 天前

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

fine-tuning llama 大语言模型 mistral gemma llama3 unsloth deepseek deepseek-r1 gemma3 text-to-speech tts qwen qwen3 agent openai gpt-oss voice-cloning reinforcement-learning

Python

46569

3808

1 小时前

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python text-to-speech 深度学习 speech PyTorch tts vocoder tacotron glow-tts melgan speaker-encoder hifigan speaker-encodings multi-speaker-tts tts-model speech-synthesis voice-cloning voice-synthesis voice-conversion

Python

42854

5661

1 年前

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

audio-generation gpt-4o text-to-speech tts cantonese 聊天机器人 ChatGPT chinese english fine-grained fine-tuning japanese korean multi-lingual natural-language-generation Python cosyvoice cross-lingual voice-cloning

Python

16710

1815

7 天前

Huanshere / VideoLingo

Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音，一键全自动视频搬运AI字幕组

ai-translation dubbing Localization (l10n)video-translation voice-cloning

Python

15034

1544

5 个月前

PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

transformer conformer speech-translation streaming-asr speech-alignment punctuation-restoration streaming-tts speech-synthesis tts asr speech-recognition voice-cloning vocoder voice-recognition self-supervised-learning Whisper

Python

12266

1938

7 天前

DrewThomasson / ebook2audiobook

Generate audiobooks from e-books, voice cloning & 1107+ languages!

audiobooks Docker epub Linux macOS tts Windows xtts voice-cloning gradio chinese english multilingual colab-notebook kaggle audiobook

Python

11482

868

9 天前

multimodal-art-projection / YuE

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

foundation-models music-generation huggingface llama audio-generation voice-cloning 大语言模型人工智能深度学习 gpt

Python

5554

639

4 个月前

abus-aikorea / voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

faster-whisper tts Whisper gradio subtitles transcription translator webui speech-recognition speech-synthesis speech-to-text text-to-speech yt-dlp voice-cloning podcasts audiobook voice-conversion karaoke whisperx

Python

4840

414

2 个月前

Camb-ai / MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

speech speech-synthesis text-to-speech voice-cloneai voice-cloning

Jupyter Notebook

2794

246

1 年前

IAHispano / Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

rvc vc vits voice 人工智能 voice-cloning voice-conversion applio voice-clone PyTorch speech text-to-speech tts

Python

2628

437

3 小时前

voice-cloning-app / Voice-Cloning-App

A Python/Pytorch app for easily synthesising human voices

Python tts text-to-speech PyTorch 深度学习 voice-cloning tacotron2

Python

1440

241

10 个月前

coqui-ai / open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

tts stt speech-to-text text-to-speech speech-recognition speech-synthesis speech-processing voice-recognition voice-activity-detection voice-cloning speech-separation

1358

148

1 年前

gitmylo / audio-webui

A webui for different audio related Neural Networks

人工智能 audioldm bark rvc text-to-audio text-to-speech voice-cloning audiocraft music generative-music tts aio all-in-one

Python

1202

105

5 个月前

panyanyany / Twocast

AI Podcast Generator for bilingual episodes, Multi Languages, Alternative to NotebookLLM；真人对话AI播客生成器，多语言，多音色

podcast podcast-generator voice-cloning voice-synthesis

TypeScript

1056

3 个月前

Enemyx-net / VibeVoice-ComfyUI

A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.

comfyui-nodes text-to-speech tts voice-cloning

Python

995

158

2 天前

MiniMax-AI / MiniMax-MCP

Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.

image-generation mcp mcp-server mcp-tools text-to-speech video-generation image-to-video text-to-image text-to-video voice-cloning

Python

964

154

3 个月前