Repository navigation

#

speaker-identification

Native Swift and CoreML SDK for local speaker diarization, VAD, and speech-to-text for real-time workloads. Works on iOS and macOS.

Swift
719
1 天前

PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.

Python
591
4 年前

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

Python
431
2 个月前

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

HTML
371
4 个月前

Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library

Python
211
5 年前

Simple d-vector based Speaker Recognition (verification and identification) using Pytorch

Python
211
5 年前

Deep Learning - one shot learning for speaker recognition using Filter Banks

Jupyter Notebook
170
1 年前

Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"

Python
153
10 个月前

A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.

Perl
136
6 年前

打造最简单的TTS前端集合,最简单的有声小说制作工作流。基于正则规则对小说进行分句,基于RoBERTa对小说中的对话进行说话人识别,从而实现一键式生成多人有声小说。多说话人的语音合成,高质量的有声小说制作。

Python
133
1 个月前

[SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model

Python
126
1 年前

This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1

Python
113
6 年前

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

Forth
107
3 年前

Source code for paper "Who is real Bob? Adversarial Attacks on Speaker Recognition Systems" (IEEE S&P 2021)

Python
104
3 年前

Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"

Python
103
7 年前

Pytorch implementation of Generalized End-to-End Loss for speaker verification

Python
86
6 年前

A tool for summarizing dialogues from videos or audio

Python
83
2 年前