Repository navigation

#

speaker-identification

Website
Wikipedia

alphacep / vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Jupyter Notebook

13328

1581

24 天前

mravanelli / SincNet

SincNet is a neural architecture for efficiently processing raw audio samples.

Python

1198

270

4 年前

FluidInference / FluidAudio

Native Swift and CoreML SDK for local speaker diarization, VAD, and speech-to-text for real-time workloads. Works on iOS and macOS.

coreml iOS macOS speaker-diarization speaker-identification speaker-recognition Swift audio real-time vad voice-activity-detection asr automatic-speech-recognition speech-to-text ane Nvidia

Swift

719

86

1 天前

HarryVolek / PyTorch_Speaker_Verification

PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.

PyTorch speaker-identification speaker-verification

Python

591

164

4 年前

google / speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

speaker-recognition source-separation speaker-diarization speaker-verification speaker-identification

Python

431

39

2 个月前

speechbrain / speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

深度学习 speech-recognition speech-to-text speech speech-processing speaker-recognition speaker-verification speaker-identification speech-separation speechrecognition 神经网络 neural-networks timit speech-analysis

HTML

371

31

4 个月前

Atul-Anand-Jha / Speaker-Identification-Python

Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library

Python speaker-recognition speaker-identification

Python

211

77

5 年前

jymsuper / SpeakerRecognition_tutorial

Simple d-vector based Speaker Recognition (verification and identification) using Pytorch

speaker-recognition 深度学习 speaker-verification speaker-identification PyTorch

Python

211

46

5 年前

oscarknagg / voicemap

Identifying people from small audio fragments

机器学习 speaker-identification speaker-recognition convolutional-neural-networks

Python

170

72

5 年前

Speaker-Identification / You-Only-Speak-Once

Deep Learning - one shot learning for speaker recognition using Filter Banks

triplet-loss speaker-recognition 神经网络 audio speech speaker-identification 深度学习

Jupyter Notebook

170

41

1 年前

kaistmm / Audio-Mamba-AuM

Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"

audio audio-classification 深度学习 mamba PyTorch representation-learning speaker-identification

Python

153

18

10 个月前

jefflai108 / pytorch-kaldi-neural-speaker-embeddings

A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.

speaker-verification speaker-recognition speech-processing speaker-identification PyTorch kaldi

Perl

136

34

6 年前

Warma10032 / easytts

打造最简单的TTS前端集合，最简单的有声小说制作工作流。基于正则规则对小说进行分句，基于RoBERTa对小说中的对话进行说话人识别，从而实现一键式生成多人有声小说。多说话人的语音合成，高质量的有声小说制作。

人工智能 audio-generation 自然语言处理 pyqt speaker-identification tts

Python

133

25

1 个月前

SiavashShams / ssamba

[SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model

audio audio-classification mamba representation-learning self-supervised-learning speaker-identification 深度学习 emotion-recognition

Python

126

11

1 年前

Anwarvic / Speaker-Recognition

This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1

speaker-recognition speaker-verification speaker-identification

Python

113

33

6 年前

Appen / UHV-OTS-Speech

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

speech-processing speech-recognition speaker-diarization speaker-identification

Forth

107

19

3 年前

FAKEBOB-adversarial-attack / FAKEBOB

Source code for paper "Who is real Bob? Adversarial Attacks on Speaker Recognition Systems" (IEEE S&P 2021)

adversarial-attacks speaker-identification speaker-verification

Python

104

29

3 年前

funcwj / ge2e-speaker-verification

Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"

speaker-verification PyTorch speaker-identification

Python

103

25

7 年前

cvqluu / GE2E-Loss

Pytorch implementation of Generalized End-to-End Loss for speaker verification

speaker-verification PyTorch speaker-identification speaker-diarization speaker-recognition

Python

86

16

6 年前

nezhar / speech-condenser

A tool for summarizing dialogues from videos or audio

asr speaker-diarization speaker-identification summarization

Python

83

10

2 年前