Repository navigation

#

speaker-diarization

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python
9870
6 天前

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook
7302
5 天前

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook
4400
1 个月前

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

1958
8 小时前

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python
1930
1 天前
wq2012/awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1726
6 个月前
google/uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

Python
1571
7 个月前
juanmc2005/diart

A python package to build AI-powered real-time audio applications

Python
1253
2 个月前

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Python
879
2 个月前

turnkey self-hosted offline transcription and diarization service with llm summary

Python
836
7 个月前

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

Python
529
7 个月前

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

Python
482
4 年前

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

Python
411
20 天前

Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.

Python
404
1 年前