Repository navigation

#

speaker-diarization

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python
12114
5 天前

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook
8100
3 天前

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook
4852
1 天前

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

2369
4 个月前

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python
2296
7 天前
wq2012/awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1787
1 个月前
google/uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

Python
1583
1 年前
juanmc2005/diart

A python package to build AI-powered real-time audio applications

Python
1406
6 个月前

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Python
1001
2 个月前

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

Python
536
1 年前

Native Swift and CoreML SDK for local speaker diarization, VAD, and speech-to-text for real-time workloads. Works on iOS and macOS.

Swift
525
13 分钟前

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

Python
487
4 年前

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

Python
424
7 天前