Repository navigation
voice-conversion
- Website
- Wikipedia
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Easily train a good VC model with voice data <= 10 mins!
SoftVC VITS Singing Voice Conversion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
so-vits-svc fork with realtime support, improved interface and more features.
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
zero-shot voice conversion & singing voice conversion, with real-time support
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
A simple, high-quality voice conversion tool focused on ease of use and performance.
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
This is now the official location of the Merlin project.
Easily select, start, and manage your preferred AI digital assistants
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
一个简易的AI语音工具箱 | A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion etc.
The code for the bark-voicecloning model. Training and inference.
Deep learning for audio processing
Unsupervised Speech Decomposition Via Triple Information Bottleneck