Repository navigation

#

speaker-diarization

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python
12897
4 天前

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook
8422
12 小时前

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook
4999
2 个月前

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

2535
5 个月前

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python
2439
2 个月前
wq2012/awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1805
2 个月前
google/uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

Python
1585
1 年前
juanmc2005/diart

A python package to build AI-powered real-time audio applications

Python
1470
8 个月前

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Python
1042
18 天前

Native Swift and CoreML SDK for local speaker diarization, VAD, and speech-to-text for real-time workloads. Works on iOS and macOS.

Swift
719
1 天前

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

Python
540
1 年前

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

Python
491
4 年前

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

Python
431
2 个月前