Repository navigation
voice-conversion
- Website
- Wikipedia
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Easily train a good VC model with voice data <= 10 mins!
SoftVC VITS Singing Voice Conversion
so-vits-svc fork with realtime support, improved interface and more features.
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
A simple, high-quality voice conversion tool focused on ease of use and performance.
zero-shot voice conversion & singing voice conversion, with real-time support
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
This is now the official location of the Merlin project.
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
可本地部署的AI语音工具箱 | A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion etc.
The code for the bark-voicecloning model. Training and inference.
Unsupervised Speech Decomposition Via Triple Information Bottleneck
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
Deep learning for audio processing