Repository navigation

#

deepspeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

C++
26241
8 个月前
Jupyter Notebook
9273
1 个月前

Examples of how to use or integrate DeepSpeech

Python
844
2 年前

基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。

Python
726
4 个月前

Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。

Python
664
3 天前

A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui

Python
596
1 年前

DeepSpeech based forced alignment tool

Python
237
4 年前

A testing server for a speech to text service based on coqui.ai

Python
215
3 年前

Golang bindings for Mozilla's DeepSpeech speech-to-text library

Go
182
3 年前

Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments

Python
102
5 年前

Android Speech Recognition Service using Vosk/Kaldi and Mozilla DeepSpeech

Java
99
3 年前

Install Mozilla DeepSpeech on a Raspberry Pi 4

99
4 年前

Tooling for producing Italian model (public release available) for DeepSpeech and text corpus

Python
94
3 年前

Traditional ASR (Signal & Cepstral Analysis, DTW, HMM) & DNNs (Custom Models + DeepSpeech) on Indian Accent Speech

Jupyter Notebook
92
2 年前

An editor for speech-to-text transcripts such as AWS Transcribe and Mozilla DeepSpeech

JavaScript
87
2 个月前

A MXNet implementation of Baidu's DeepSpeech architecture

Python
83
7 年前