Repository navigation

#

voice-cloning

CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python
54011
8 个月前

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python
44329
4 天前
Python
13162
8 小时前

Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组

Python
12339
2 天前

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Python
11796
3 天前

Convert ebooks to audiobooks with chapters and metadata using dynamic AI models and voice cloning. Supports 1,107+ languages!

Python
9526
12 天前

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python
4821
12 天前
abus-aikorea/voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

Python
3605
3 天前
Jupyter Notebook
2721
9 个月前
IAHispano/Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

Python
2278
15 小时前

A Python/Pytorch app for easily synthesising human voices

Python
1437
5 个月前

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Python
836
2 年前

The code for the bark-voicecloning model. Training and inference.

Python
695
2 年前

singing voice change based on whisper, and lora for singing voice clone

Python
637
1 年前

PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Parallel WaveGAN)

Python
603
3 年前

Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RVC-enhanced, XTTS fine-tuning) and LLM processing. It aspires to be a user-friendly app with a GUI, an installer and all-in-one packages.

Python
448
6 天前