Repository navigation

#

speech-recognition

ggml-org/whisper.cpp
C++
39336
2 天前

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

C++
26241
7 个月前

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python
15039
7 天前

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell
14779
3 个月前

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Jupyter Notebook
14168
8 个月前

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Python
11796
3 天前

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python
9853
5 天前
Jupyter Notebook
9273
1 个月前

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Python
8691
3 天前

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Python
8093
7 个月前

💬 Speech recognition for your site

JavaScript
6660
8 个月前

Facebook AI Research's Automatic Speech Recognition Toolkit

C++
6421
5 个月前