Repository navigation

#

speech-processing

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook
8100
3 天前

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Python
1979
2 年前

AI powered speech denoising and enhancement

Python
1931
9 个月前
wq2012/awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1787
1 个月前
DigitalPhonetics/IMS-Toucan

Controllable and fast Text-to-Speech for over 7000 languages!

Python
1633
2 个月前

Speech, Language, Audio, Music Processing with Large Language Model

Python
873
13 天前

You can find the speech algorithms you want here

C
827
24 天前

Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

Python
800
3 个月前