Repository navigation

#

wav2vec

s3prl/s3prl
Python
2448
2 个月前

A live speech recognition using Facebooks wav2vec 2.0 model.

Python
363
2 年前

PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI

Python
181
2 年前

Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf

Python
66
4 年前

Training scripts for Speech-To-Text models for Ukrainian language

Jupyter Notebook
38
2 年前

Wav2vec resources and models for Brazilian Portuguese

Jupyter Notebook
34
3 年前

Speech to Text with self-supervised learning based on wav2vec 2.0 framework using Hugging Face's Transformer

Jupyter Notebook
29
4 年前

Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition

Python
24
4 年前
Python
14
4 年前

A library version of wav2vec 2.0 framework for Automatic Speech Recognition task.

Python
4
4 年前

Evaluating the Effectiveness of Transformer Layers in Wav2Vec 2.0, XLS-R, and Whisper for Speaker Identification Tasks

Jupyter Notebook
3
7 个月前

Create high-resolution visually dubbed videos with DINet

Python
3
1 年前

Building a speaker identification & verification pipeline for Vietnamese voices 😪

Jupyter Notebook
3
4 年前

A repo to make installation and training of a wav2vec model easier

Python
2
5 年前

👩🏻‍💻 ( ͡❛ ‿●‿ ͡❛) wav2vec

2
4 年前

This repository contains scripts to prune Wav2vec2 using a neuroevolution-based method. More details about this method can be found in the paper Compressing Wav2vec2 for Embedded Applications.

Shell
2
2 年前