Repository navigation

speaker-recognition

Website
Wikipedia

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

machine-translation speaker-recognition asr tts generative-ai multimodal 深度学习 neural-networks speaker-diariazation speech-translation speech-synthesis large-language-models

Python

15805

3121

3 小时前

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

speech-recognition speech-toolkit speaker-recognition speech-to-text speech-enhancement speech-separation audio audio-processing speech-processing speechrecognition asr voice-recognition speaker-diarization speaker-verification PyTorch huggingface transformers language-model 深度学习

Python

10512

1559

9 天前

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

PyTorch speech-processing speaker-diarization voice-activity-detection pretrained-models speaker-recognition speaker-verification

Jupyter Notebook

8422

949

15 小时前

google / uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

speaker-diarization uis-rnn speaker-recognition supervised-learning clustering supervised-clustering 机器学习

Python

1585

321

1 年前

mravanelli / SincNet

SincNet is a neural architecture for efficiently processing raw audio samples.

Python

1198

270

4 年前

clovaai / voxceleb_trainer

In defence of metric learning for speaker recognition

speaker-recognition metric-learning speaker-verification

Python

1133

286

2 年前

yeyupiaoling / VoiceprintRecognition-Pytorch

This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods

PyTorch voice-recognition arcface speaker-recognition

Python

1123

160

4 个月前

wenet-e2e / wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

production-ready PyTorch resnet speaker-recognition speaker-verification speaker-diarization repvgg TLS (Transport Layer Security)dino wavlm

Python

1042

156

18 天前

athena-team / athena

an open-source implementation of sequence-to-sequence based speech processing engine

speech-recognition asr transformer Tensorflow ctc unsupervised-learning sequence-to-sequence 部署 speaker-recognition tts speech-synthesis

C++

961

201

3 年前

astorfi / 3D-convolutional-speaker-recognition

🔈 Deep Learning & 3D Convolutional Neural Networks for Speaker Verification

convolutional-neural-networks 深度学习 speaker-recognition 3D

Python

790

269

6 年前

TaoRuijie / ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

speaker-recognition speaker-verification

Python

736

126

1 年前

FluidInference / FluidAudio

Native Swift and CoreML SDK for local speaker diarization, VAD, and speech-to-text for real-time workloads. Works on iOS and macOS.

coreml iOS macOS speaker-diarization speaker-identification speaker-recognition Swift audio real-time vad voice-activity-detection asr automatic-speech-recognition speech-to-text ane Nvidia

Swift

719

3 小时前

cvqluu / Angular-Penalty-Softmax-Losses-Pytorch

Angular penalty loss functions in Pytorch (ArcFace, SphereFace, Additive Margin, CosFace)

metric-learning PyTorch loss-functions embedding face-verification fashion-mnist face-recognition speaker-recognition sphereface arcface

Python

493

2 年前

taylorlu / Speaker-Diarization

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

uis-rnn speaker-diarization speaker-recognition

Python

491

120

4 年前

google / speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

speaker-recognition source-separation speaker-diarization speaker-verification speaker-identification

Python

431

2 个月前

nuaazs / VAF_2

Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.

antifraud 微服务 speaker-diarization speaker-recognition speech-recognition

Python

395

1 年前

speechbrain / speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

深度学习 speech-recognition speech-to-text speech speech-processing speaker-recognition speaker-verification speaker-identification speech-separation speechrecognition 神经网络 neural-networks timit speech-analysis

HTML

371

4 个月前