Repository navigation

neural-tts

Website
Wikipedia

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

text-to-speech normalizing-flows generative-model 深度神经网络 PyTorch tts speech-synthesis neural-tts non-autoregressive portable-tts vae fastspeech hifi-gan high-quality

Python

341

4 年前

keonlee9420 / DiffGAN-TTS

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

text-to-speech 深度神经网络 PyTorch tts speech-synthesis generative-model ddpm diffusion neural-tts non-autoregressive Generative Adversarial Network hifi-gan diffusion-models fastspeech multi-speaker-tts

Python

340

4 年前

keonlee9420 / Comprehensive-Transformer-TTS

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS

text-to-speech unsupervised non-autoregressive multi-speaker tts PyTorch fastspeech transformer neural-tts fastspeech2 hifi-gan sota speech-synthesis 深度学习

Python

326

3 年前

KevinMIN95 / StyleSpeech

Official implementation of Meta-StyleSpeech and StyleSpeech

official tts meta-learning text-to-speech neural-tts speech-synthesis speech

Python

251

4 年前

keonlee9420 / DiffSinger

PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)

text-to-speech diffusion ddpm PyTorch singing-voice tts speech-synthesis english diffusion-models neural-tts non-autoregressive fastspeech diffsinger

Python

242

4 年前

keonlee9420 / StyleSpeech

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

text-to-speech PyTorch tts speech-synthesis english style neural-tts non-autoregressive fastspeech meta-learning speaker speaker-adaptation

Python

195

4 年前

keonlee9420 / Cross-Speaker-Emotion-Transfer

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

text-to-speech 深度神经网络 PyTorch tts speech-synthesis generative-model neural-tts non-autoregressive semi-supervised-learning

Python

194

3 年前

keonlee9420 / Parallel-Tacotron2

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

neural-tts non-autoregressive vae self-attention duration speech-synthesis PyTorch tts text-to-speech english fastspeech

Python

190

4 年前

keonlee9420 / Comprehensive-E2E-TTS

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS

深度学习 fastspeech2 hifi-gan jets multi-speaker neural-tts non-autoregressive PyTorch sota speech-synthesis text-to-speech tts unsupervised end-to-end

Python

146

3 年前

keonlee9420 / VAENAR-TTS

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

vae glow non-autoregressive tts text-to-speech duration PyTorch speech-synthesis self-attention neural-tts unsupervised-learning

Python

4 年前

mush42 / sonata

A cross-platform inference engine for neural TTS models.

C gRPC neural-tts Python speech-synthesis text-to-speech tts

Rust

10 个月前

keonlee9420 / FastPitchFormant

PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis

text-to-speech end-to-end neural-tts PyTorch tts speech-synthesis pitch fastspeech non-autoregressive

Python

4 年前

keonlee9420 / WaveGrad2

PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

text-to-speech neural-tts audio synthesis non-autoregressive score-matching duration robust PyTorch tts speech-synthesis text-to-audio end-to-end

Python

4 年前

keonlee9420 / Daft-Exprt

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

text-to-speech style PyTorch tts speech-synthesis english speaker neural-tts non-autoregressive

Python

4 年前

keonlee9420 / Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

text-to-speech tts tacotron tacotron2 PyTorch speech-synthesis autoregressive multi-speaker robustness efficiency neural-tts hifi-gan 深度学习

Python

2 年前

Mobile-Artificial-Intelligence / babylon.cpp

Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port of the DeepPhonemizer model is used. For speech synthesis VITS models are used. Piper models are compatible after a conversion script is run.

人工智能 elevenlabs neural-tts onnx onnxruntime tts vits voice-cloning onnx-models onnx-runtime

Python

1 个月前

keonlee9420 / Deep-Learning-TTS-Template

This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).

text-to-speech PyTorch tts speech-synthesis 深度学习 fastspeech non-autoregressive neural-tts template

Python

4 年前

QuantiusBenignus / voluble

Let your GNOME desktop speak to you. Reads your desktop notifications or selected text out-loud with human-like voice using Piper. Uses a local LLM to summarize selected text.

gnome gnome-shell-extension neural-tts notifications text-to-speech Web Accessibility (a11y)kiss speech-synthesis tts autoencoder 深度学习 vits gnome-extension 机器学习

JavaScript

4 个月前