Repository navigation
speaker-adaptation
- Website
- Wikipedia
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
This repository has implementation for "Neural Voice Cloning With Few Samples"
PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
PyTorch implementation of Densely Connected Time Delay Neural Network
DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020
Open Source Implementation of Neural Voice Cloning with Few Audio Samples (Baidu Research)
Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021
Implementation of the paper "Speaker Adaptive Training for Speech Recognition Based on Attention-over-Attention Mechanism" in Pytorch.