Repository navigation

#

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python
2108
9 个月前

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS

Python
325
3 年前
Python
229
2 年前

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS

Python
146
3 年前

Avocodo: Generative Adversarial Network for Artifact-free Vocoder

Python
120
3 年前

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Python
48
2 年前

Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.

Python
29
2 年前

A neural speech codec based on discrete WavLM representations

Python
23
8 个月前

Audio samples from "HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis"

HTML
10
4 年前

This is the experimental description of MnTTS2.

Jupyter Notebook
10
1 年前

In this repo, I developed a step-by-step pipeline for a standard MultiSpeaker Text-to-Speech system 😄 In general, I used Portaspeech as an acoustic model and iSTFTNet as vocoder...

Python
10
1 年前

DelightfulTTS with Hifi-GAN and Univnet vocoders

Jupyter Notebook
8
10 个月前

Python package for NSF and NSF-HiFi-GAN (unofficial)

Python
6
6 天前