Repository navigation
melgan
- Website
- Wikipedia
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech ✊
MelGAN implementation with Multi-Band and Full Band supports...
Ultrafast GAN based Vocoder for Text to Speech
zero-shot realtime TTS system, fully offline, free and open source
Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"
Voice Conversion pipeline consisting of GE2E speaker encoder, AutoVC conversion model and MelGAN vocoder.
MelGAN Multi GPU Implementation.
MelGAN with catalyst framework
SE-MelGAN - Speaker Agnostic Rapid Speech Enhancement