Repository navigation

#

multi-speaker

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Python
1976
1 年前

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets

Python
473
3 年前

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS

Python
325
3 年前

Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.

Jupyter Notebook
309
3 年前

VoxNovel: generate audiobooks giving each character a different voice actor.

Python
269
3 个月前

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS

Python
146
3 年前

Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem

Jupyter Notebook
51
7 年前

This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.

Python
49
1 个月前

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Python
48
2 年前

Multi-Speaker FastSpeech2 applicable to Korean. Description about train and synthesize in detail.

Python
8
3 年前

An Algorithm for Speaker Recognition in a Multi-Speaker Environment

Python
4
5 年前

Urdu Speech Recognition using Kaldi ASR, by training Triphone Acoustic GMMs using the PRUS dataset.

Shell
4
4 年前