Repository navigation

#

speechrecognition

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

HTML
365
4 个月前

A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.

Tcl
269
1 年前

It is a personal assistant chatbot, capable to perform many tasks same as Google Assistant plus more extra features...

Python
132
2 年前

A pytorch based end2end speech recognition system.

Python
113
4 年前

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.

Python
89
4 年前

Repository containing experimentation platform on how to train, infer on wav2vec2 models.

Python
86
3 年前

🙊 Speech Recognition , Text To Speech , Google Translate

Java
81
2 年前

A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Google Speech Recognition API) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file

Python
61
1 年前

Open source projects related to Snips https://snips.ai/.

JavaScript
54
2 年前

the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTTS2 and supplements additional data from other sources. KSC2 contains around 1.2k hours of high-quality transcribed data comprising over 600k utterances.

Shell
50
4 年前

Pytorch based phoneme recognition (TIMIT phoneme classification)

Python
34
7 年前

PySimpleGUI based DESKTOP APP that can RECOGNIZE any live streaming in 23 languages that supported by VOSK then TRANSLATE (using unofficial online Google Translate API) and display it as LIVE CAPTION / LIVE SUBTITLE

Python
28
1 年前