Repository navigation

speaker-diarization

Website
Wikipedia

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

conformer PyTorch speech-recognition paraformer punctuation speaker-diarization rnnt audio-visual-speech-recognition pretrained-model voice-activity-detection Whisper dfsmn vad speechgpt speechllm

Python

9870

989

6 天前

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

speech-recognition speech-toolkit speaker-recognition speech-to-text speech-enhancement speech-separation audio audio-processing speech-processing speechrecognition asr voice-recognition speaker-diarization speaker-verification PyTorch huggingface transformers language-model 深度学习

Python

9712

1474

4 天前

espnet / espnet

End-to-End Speech Processing Toolkit

深度学习 end-to-end chainer PyTorch kaldi speech-recognition speech-synthesis speech-translation machine-translation voice-conversion speech-enhancement speech-separation singing-voice-synthesis speaker-diarization text-to-speech

Python

9010

2250

8 天前

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

PyTorch speech-processing speaker-diarization voice-activity-detection pretrained-models speaker-recognition speaker-verification

Jupyter Notebook

7302

868

5 天前

MahmoudAshraf97 / whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

asr speaker-diarization speech speech-recognition speech-to-text Whisper

Jupyter Notebook

4400

403

1 个月前

linto-ai / whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

深度学习 speech speech-recognition speech-to-text asr 机器学习 Python PyTorch attention-is-all-you-need attention-mechanism attention-model speaker-diarization speech-processing transformers Whisper

Python

2365

181

20 天前

Purfview / whisper-standalone-win

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

openai speech-to-text Whisper asr speech-recognition subtitles ctranslate2 faster-whisper whisperx uvr diarization speaker-diarization

1958

8 小时前

modelscope / 3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

speaker-diarization speaker-verification language-identification modelscope

Python

1930

165

1 天前

wq2012 / awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

speaker-diarization Awesome Lists 机器学习 speech-recognition speech-processing 深度学习

1726

232

6 个月前

google / uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

speaker-diarization uis-rnn speaker-recognition supervised-learning clustering supervised-clustering 机器学习

Python

1571

320

7 个月前