Repository navigation

#

voice-clone

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python
50145
18 天前

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python
34068
4 个月前

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python
7909
2 年前

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python
4851
7 天前
IAHispano/Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

Python
2527
1 天前
wladradchenko/wunjo.wladradchenko.ru

Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, Video Generation. Open Source, Local & Free.

Python
1060
3 个月前

Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RVC-enhanced, XTTS fine-tuning) and LLM processing. It aspires to be a user-friendly app with a GUI, an installer and all-in-one packages.

Python
489
4 个月前

Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.

Python
454
9 个月前

Unoffical implementation of Megatts2

Python
286
1 年前

Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling users to generate voice cloned speech anywhere the OpenAI API is used (e.g. Open WebUI, AnythingLLM, etc.)

Python
231
1 个月前

Revolutionize Your Voice with AI Voice Cloner! Transform Your Speech into Your Favorite Celebrity's or Your Customized Voice. Our Cutting-edge Tool Converts Text or Any Audio into Your Desired Voice – Your Voice, Your Way

Python
57
5 个月前

Automated voice dubbing for YouTube videos using Docker, OpenVoice, and FastAPI. Translates and dubs videos with original voice timbre.

Python
55
2 年前

Talking Head of your favorite rapper using Transformers, PyTorch, Tortoise TTS, and OpenCV 🎵

Python
48
1 年前

[Interspeech 2025] Official implementation of "Training-Free Voice Conversion with Factorized Optimal Transport"

Jupyter Notebook
37
14 天前

This repo is text to speech with learnable audio encoder without alignment with transcript reference

Python
31
1 天前

Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.

Python
30
3 年前

Tingshu 听舒 | Bringing the author’s voice directly to you

Python
29
9 个月前

Dockerized Voicecraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild

Python
17
1 年前