Repository navigation
speechbrain
- Website
- Wikipedia
Extensions to YAML syntax for better python interaction
Backend of anti-fraud system based on speaker identification technology. 基于声纹识别的反诈系统后端
StutterFormer is an AI model that aims to be able to receive a speech sample with stuttering disfluencies, and return it with the disfluencies attenuated or eliminated.
Target speaker automatic speech recognition (TS-ASR)
Incremental learning for automatic speech recognition (ASR)
Record voice, transcribe a prompt, picturize the prompt, create variations, get description of a celebrity and upload, other use cases on KB
Real-Time Speaker Diarization (SpeechBrain ECAPA-TDNN) & Speech-to-Text Demo (AZURE SPEECH SDK)
A Streamlit web app for speaker diarization and identification in audio files. Upload or record audio, transcribe conversations, and automatically segment and label speakers using reference samples. This app makes it easy to analyze multi-speaker audio, export transcripts, and identify "who spoke when" for meetings, interviews, and more.
Implementation of different curriculum learning (CL) methods for speechbrain's ASR recipes.
Processing EEG data using Speechbrain-MOABB and model tuning to get best results
pretrained SpeechBrain wav2vec seq2seq+CTC model trained on TIMIT dataset. Created by Kip McCharen, Siddharth Surapaneni, and Pavan Bondalapati
[Research] A Perceptual Loss Based Complex Neural Beamforming for AmbiX 3D Speech Enhancement
Speaker verification of virtual assistants using ECAPA-TDNN model from SpeechBrain toolkit and transfer learning approach emphasizing on inter and intra comparision (text independent and dependent).
AudioSpeakerVerification: FastAPI-based API for Speaker Matching and Verification using SpeechBrain. Compare and verify speaker identities from audio files.
This project is a Voice Identification System built using Python, leveraging SpeechBrain and ECAPA-TDNN for speaker verification. The system identifies users by comparing their voice embeddings with stored data, providing a secure and efficient method for user recognition.
A Speech Recognition Framework for Banking Interactions using Convolutional Recurrent Dense Neural Networks and Language Models
Speech transcription and speech diarization
Speech Emotion Recognition SE&R 2022
Dockerized Zeroc-ICE architecture processing voice commands from a Xamarin mobile application via an Automatic Speech Recognition (ASR) AI model using SpeechBrain.