Repository navigation

non-autoregressive

Website
Wikipedia

lucidrains / soundstorm-pytorch

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

人工智能 audio-generation 深度学习 non-autoregressive transformers attention-mechanism

Python

1534

5 个月前

ictnlp / StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

seamless speech speech-recognition speech-synthesis speech-to-text speech-translation translation all-in-one machine-translation streaming-audio text-to-speech asr tts voice text-to-audio non-autoregressive speech-enhancement audio-processing speech-processing

Python

1156

3 个月前

shivammehta25 / Matcha-TTS

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

深度学习机器学习 non-autoregressive probabilistic text-to-speech tts tts-api diffusion-model diffusion-models

Jupyter Notebook

1135

157

12 天前

keonlee9420 / PortaSpeech

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

text-to-speech normalizing-flows generative-model 深度神经网络 PyTorch tts speech-synthesis neural-tts non-autoregressive portable-tts vae fastspeech hifi-gan high-quality

Python

341

4 年前

keonlee9420 / DiffGAN-TTS

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

text-to-speech 深度神经网络 PyTorch tts speech-synthesis generative-model ddpm diffusion neural-tts non-autoregressive Generative Adversarial Network hifi-gan diffusion-models fastspeech multi-speaker-tts

Python

340

4 年前

keonlee9420 / Comprehensive-Transformer-TTS

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS

text-to-speech unsupervised non-autoregressive multi-speaker tts PyTorch fastspeech transformer neural-tts fastspeech2 hifi-gan sota speech-synthesis 深度学习

Python

326

3 年前

keonlee9420 / Expressive-FastSpeech2

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

text-to-speech tts speech-synthesis non-autoregressive

Python

307

4 年前

keonlee9420 / DiffSinger

PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)

text-to-speech diffusion ddpm PyTorch singing-voice tts speech-synthesis english diffusion-models neural-tts non-autoregressive fastspeech diffsinger

Python

242

4 年前

keonlee9420 / DailyTalk

Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023

PyTorch text-to-speech tts conversational-ai dataset non-autoregressive speech-synthesis

Python

241

4 个月前

keonlee9420 / StyleSpeech

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

text-to-speech PyTorch tts speech-synthesis english style neural-tts non-autoregressive fastspeech meta-learning speaker speaker-adaptation

Python

195

4 年前

keonlee9420 / Cross-Speaker-Emotion-Transfer

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

text-to-speech 深度神经网络 PyTorch tts speech-synthesis generative-model neural-tts non-autoregressive semi-supervised-learning

Python

194

3 年前

keonlee9420 / Parallel-Tacotron2

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

neural-tts non-autoregressive vae self-attention duration speech-synthesis PyTorch tts text-to-speech english fastspeech

Python

190

4 年前

HKUNLP / diffusion-of-thoughts

[NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"

diffusion-models 机器学习 mathematical-reasoning 自然语言处理 non-autoregressive PyTorch text-generation

Python

180

7 个月前

xcfcode / What-I-Have-Read

Paper Lists, Notes and Slides, Focus on NLP. For summarization, please refer to https://github.com/xcfcode/Summarization-Papers

自然语言处理 summarization acl aaai naacl slides presentation gnn knowledge-distillation pretrain Generative Adversarial Network non-autoregressive generation graph-neural-networks notes presentations data-augmentation meta-learning conversation

165

3 年前

keonlee9420 / Comprehensive-E2E-TTS

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS

深度学习 fastspeech2 hifi-gan jets multi-speaker neural-tts non-autoregressive PyTorch sota speech-synthesis text-to-speech tts unsupervised end-to-end

Python

146

3 年前

HKUNLP / reparam-discrete-diffusion

Reparameterized Discrete Diffusion Models for Text Generation

Python PyTorch 自然语言处理机器学习 diffusion-models fairseq text-generation language-model non-autoregressive

Python

101

3 年前