Repository navigation

wav2vec

Website
Wikipedia

Self-Supervised Speech Pre-training and Representation Learning Toolkit

speech-representation mockingjay representation-learning apc tera self-supervised-learning speech-pretraining vq-apc wav2vec hubert wavlm

Python

2372

498

1 个月前

mailong25 / self-supervised-speech-recognition

speech to text with self-supervised learning based on wav2vec 2.0 framework

speech-recognition self-supervised-learning wav2vec speech-to-text semi-supervised-learning unsupervised-learning

Python

384

117

3 年前

oliverguhr / wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

speech-recognition pyaudio wav2vec speech-to-text asr speech

Python

350

1 年前

arxyzan / data2vec-pytorch

PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI

PyTorch self-supervised-learning fairseq roberta wav2vec huggingface beit

Python

177

2 年前

shangeth / SpeakerProfiling

Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf

speech-processing lstm wav2vec cnn speech classification speaker-recognition speaker-verification audio-processing

Python

4 年前

robinhad / voice-recognition-ua

Training scripts for Speech-To-Text models for Ukrainian language

deepspeech speech-recognition coqui-ai stt speech-to-text asr wav2vec

Jupyter Notebook

2 年前

lucasgris / wav2vec4bp

Wav2vec resources and models for Brazilian Portuguese

portuguese wav2vec automatic-speech-recognition dataset speech-to-text

Jupyter Notebook

3 年前

loretoparisi / wave2vec-recognize-docker

Wave2vec 2.0 Recognize pipeline

wav2vec Docker asr automatic-speech-recognition PyTorch wav2letter kenlm

Python

4 年前

bhattbhavesh91 / wav2vec2-huggingface-demo

Speech to Text with self-supervised learning based on wav2vec 2.0 framework using Hugging Face's Transformer

wav2vec speech-recognition speech-to-text unsupervised-learning self-supervised-learning speech-processing speech

Jupyter Notebook

4 年前

daanzu / wav2vec2_stt_python

Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition

speech-recognition speech-to-text speech Python PyTorch wav2vec

Python

4 年前

notAI-tech / IndicASR

Speeech Recognition for Indic languages.

speech-recognition transformers PyTorch wav2vec asr speech-to-text

Python

4 年前

jvel07 / wav2vec2_patho

Fine-tuning wav2vec2 to for Pathological Speech Processing

深度学习 emotion-recognition PyTorch sound-processing speech-processing speech-recognition wav2vec fine-tuning transformers

Jupyter Notebook

1 年前

thisisHJLee / Fine-Tuning-of-XLSR-Wav2Vec2-on-Korean

avr stt wav2vec 自然语言处理 transformers

Jupyter Notebook

2 年前

phucpx268 / wav2asr

A library version of wav2vec 2.0 framework for Automatic Speech Recognition task.

speech-recognition asr speech-to-text wav2vec

Python

4 年前

slinusc / speaker_identification_evaluation

Evaluating the Effectiveness of Transformer Layers in Wav2Vec 2.0, XLS-R, and Whisper for Speaker Identification Tasks

Whisper wav2vec

Jupyter Notebook

3 个月前

manhph2211 / DSP101

Building a speaker identification & verification pipeline for Vietnamese voices 😪

siamese-network contrastive-loss wav2vec torch librosa

Jupyter Notebook

3 年前

NabinAdhikari674 / wav2vec

A repo to make installation and training of a wav2vec model easier

wav2vec

Python

4 年前

oswaldoludwig / Pruning-pre-trained-models-using-evolutionary-computation

This repository contains scripts to prune Wav2vec2 using a neuroevolution-based method. More details about this method can be found in the paper Compressing Wav2vec2 for Embedded Applications.

asr evolutionary-algorithms evolutionary-computation genetic-algorithm huggingface model-compression pre-trained-model pruning PyTorch transformer-models wav2vec

Shell

1 年前

kimtth / huggingface-wav2vec

👩🏻‍💻 ( ͡❛ ‿●‿ ͡❛) wav2vec

huggingface wav2vec

4 年前

Katashynskyi / Voice_assistant_UA_EN

No api-keys | local | llama3.1 For language studying and live translation

聊天机器人 language-classification llama 大语言模型 speech-recognition speech-to-text Streamlit voice-assistant wav2vec

Python

2 个月前