Repository navigation

speechrecognition

Website
Wikipedia

A PyTorch-based Speech Toolkit

speech-recognition speech-toolkit speaker-recognition speech-to-text speech-enhancement speech-separation audio audio-processing speech-processing speechrecognition asr voice-recognition speaker-diarization speaker-verification PyTorch huggingface transformers language-model 深度学习

Python

10296

1542

7 天前

revdotcom / reverb

Open source inference code for Rev's model

speech-recognition speech-to-text asr canary Docker Whisper Open Source speechrecognition diarization huggingface speaker-diarization 深度学习神经网络

Python

421

4 个月前

speechbrain / speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

深度学习 speech-recognition speech-to-text speech speech-processing speaker-recognition speaker-verification speaker-identification speech-separation speechrecognition 神经网络 neural-networks timit speech-analysis

HTML

370

2 个月前

SamirPaulb / real-time-voice-translator

A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.

final-year-project 机器学习 speaker-recognition speech-to-text speechrecognition text-to-speech tkinter translation GUI Python

Tcl

316

2 年前

robmsmt / KerasDeepSpeech

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

Keras deepspeech asr ctc coreml speechrecognition speech-to-text 深度学习机器学习 neural-networks baidu speech 神经网络

Python

242

7 年前

Azure-Samples / SpeechToText-WebSockets-Javascript

SDK & Sample to do speech recognition using websockets in Javascript

Microsoft speech SDK JavaScript TypeScript ts browser WebSocket cognitive-services speech-recognition recognition speechrecognition

TypeScript

219

151

6 年前

roshan9419 / PersonalAssistantChatbot

It is a personal assistant chatbot, capable to perform many tasks same as Google Assistant plus more extra features...

聊天机器人 speechrecognition tkinter pyttsx3 OpenCV

Python

133

3 年前

by2101 / OpenASR

A pytorch based end2end speech recognition system.

speech speech-recognition speech-to-text speechrecognition transformer asr

Python

115

5 年前

shangeth / wavencoder

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.

深度学习 PyTorch speech-recognition audio-processing speech-processing speechrecognition representation-learning unsupervised-learning semi-supervised-learning voice-recognition speaker-recognition Hacktoberfest

Python

4 年前

Open-Speech-EkStep / vakyansh-wav2vec2-experimentation

Repository containing experimentation platform on how to train, infer on wav2vec2 models.

speech speechrecognition speech-recognition indic-languages PyTorch asr Open Source

Python

3 年前

goxr3plus / java-google-speech-api

🙊 Speech Recognition , Text To Speech , Google Translate

speechrecognition text-to-speech google-translate

Java

2 年前

solyarisoftware / WeBAD

Web Browser Audio Detection/Speech Recording Events API

audio audio-processing speechrecognition browser JavaScript volume volume-control microphone speech recording voice voice-recognition WebRTC Document Object Model (DOM)

JavaScript

3 年前

botbahlul / autosrt

A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Google Speech Recognition API) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file

Python speech-recognition voice-recognition captions FFmpeg subtitle speechrecognition

Python

1 年前

jindongwang / EasyEspnet

Making Espnet easier to use

speech-recognition speech asr speechrecognition toolkit easy-to-use

Python

4 年前

syntithenai / opensnips

Open source projects related to Snips https://snips.ai/.

speech speechrecognition rasa kaldi Docker dialog snowboy nlu asr

JavaScript

3 年前

IS2AI / ISSAI_SAIDA_Kazakh_ASR

the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTTS2 and supplements additional data from other sources. KSC2 contains around 1.2k hours of high-quality transcribed data comprising over 600k utterances.

speech-recognition speech-synthesis speech-to-text speechrecognition

Shell

4 年前