Repository navigation

#

speech-processing

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook
8422
11 小时前

AI powered speech denoising and enhancement

Python
1991
10 个月前

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Python
1980
2 年前
wq2012/awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1805
2 个月前
DigitalPhonetics/IMS-Toucan

Controllable and fast Text-to-Speech for over 7000 languages!

Python
1644
3 个月前

Speech, Language, Audio, Music Processing with Large Language Model

Python
896
1 个月前

You can find the speech algorithms you want here

C
832
2 个月前

Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

Python
828
4 个月前