Repository navigation

#

speech-processing

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook
7302
4 天前

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Python
1976
1 年前

AI powered speech denoising and enhancement

Python
1750
5 个月前
wq2012/awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1726
6 个月前
DigitalPhonetics/IMS-Toucan

Controllable and fast Text-to-Speech for over 7000 languages!

Python
1583
5 个月前

You can find the speech algorithms you want here

C
796
4 个月前

Speech, Language, Audio, Music Processing with Large Language Model

Python
783
7 天前

A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.

MATLAB
757
4 年前