Repository navigation

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS

text-to-speech unsupervised non-autoregressive multi-speaker tts PyTorch fastspeech transformer neural-tts fastspeech2 hifi-gan sota speech-synthesis 深度学习

Python

325

3 年前

NTT123 / vietTTS

Vietnamese Text to Speech library

深度学习 tacotron vocoder hifi-gan vietnam vietnamese text-to-speech

Python

229

100

2 年前

keonlee9420 / Comprehensive-E2E-TTS

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS

深度学习 fastspeech2 hifi-gan jets multi-speaker neural-tts non-autoregressive PyTorch sota speech-synthesis text-to-speech tts unsupervised end-to-end

Python

146

3 年前

rishikksh20 / Avocodo-pytorch

Avocodo: Generative Adversarial Network for Artifact-free Vocoder

hifi-gan speech-synthesis text-to-speech tts vocoder PyTorch Generative Adversarial Network

Python

120

3 年前

nipponjo / tts-arabic-pytorch

TTS models for Arabic (Tacotron2, FastPitch)

arabic hifi-gan PyTorch tacotron2 text-to-speech tts Python 深度学习 speech speech-synthesis hifigan multi-speaker-tts tts-model voice-synthesis

Jupyter Notebook

110

5 个月前

Voice-Privacy-Challenge / Voice-Privacy-Challenge-2022

Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software

anonymization speaker-recognition asr privacy-protection 隐私监控 de-identification speech-recognition voice-conversion speech-synthesis speech-processing hifi-gan kaldi

Python

6 个月前

keonlee9420 / Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

text-to-speech tts tacotron tacotron2 PyTorch speech-synthesis autoregressive multi-speaker robustness efficiency neural-tts hifi-gan 深度学习

Python

2 年前

hwRG / End-to-End-TTS-Fine-Tune

Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.

end-to-end fastspeech2 hifi-gan tts

Python

2 年前

lucadellalib / discrete-wavlm-codec

A neural speech codec based on discrete WavLM representations

clustering codec hifi-gan PyTorch quantization self-supervised-learning speech-synthesis wavlm

Python

8 个月前

jik876 / hifi-gan-demo

Audio samples from "HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis"

speech-synthesis tts hifi-gan text-to-speech 深度学习 Generative Adversarial Network

HTML

4 年前

ssmlkl / MnTTS2

This is the experimental description of MnTTS2.

tts fastspeech2 hifi-gan multi-speaker-tts

Jupyter Notebook

1 年前

NTT123 / hifigan-tpu

Train HiFi-GAN on TPU

hifi-gan vocoder tts text-to-speech Generative Adversarial Network jax

Python

3 年前

manhph2211 / ViTTS

In this repo, I developed a step-by-step pipeline for a standard MultiSpeaker Text-to-Speech system 😄 In general, I used Portaspeech as an acoustic model and iSTFTNet as vocoder...

hifi-gan text-to-speech mfa deepspeech speech-synthesis vocoder

Python

1 年前