Repository navigation

#

speaker-identification

PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.

Python
589
4 年前

Native Swift and CoreML SDK for local speaker diarization, VAD, and speech-to-text for real-time workloads. Works on iOS and macOS.

Swift
525
14 分钟前

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

Python
424
7 天前

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

HTML
370
2 个月前

Simple d-vector based Speaker Recognition (verification and identification) using Pytorch

Python
211
5 年前

Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library

Python
209
5 年前

Deep Learning - one shot learning for speaker recognition using Filter Banks

Jupyter Notebook
170
1 年前

Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"

Python
148
9 个月前

A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.

Perl
136
6 年前

[SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model

Python
125
10 个月前

打造最简单的TTS前端集合,最简单的有声小说制作工作流。基于正则规则对小说进行分句,基于RoBERTa对小说中的对话进行说话人识别,从而实现一键式生成多人有声小说。多说话人的语音合成,高质量的有声小说制作。

Python
124
5 个月前

This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1

Python
111
6 年前

Source code for paper "Who is real Bob? Adversarial Attacks on Speaker Recognition Systems" (IEEE S&P 2021)

Python
104
3 年前

Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"

Python
103
6 年前

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

Forth
101
2 年前

Pytorch implementation of Generalized End-to-End Loss for speaker verification

Python
85
6 年前

A tool for summarizing dialogues from videos or audio

Python
82
2 年前