Repository navigation

#

speech-recognition

huggingface/transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python
148515
4 小时前
ggml-org/whisper.cpp
C++
42484
1 天前

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

C++
26568
2 个月前

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python
17364
2 个月前

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell
15057
1 个月前

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Jupyter Notebook
14453
1 年前

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Python
12168
6 天前

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python
12098
5 天前

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Python
8833
3 个月前

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Python
8208
1 年前

💬 Speech recognition for your site

JavaScript
6664
1 年前

Facebook AI Research's Automatic Speech Recognition Toolkit

C++
6436
9 个月前