Repository navigation

voice-recognition

Website
Wikipedia

alphacep / vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Jupyter Notebook

12989

1550

1 个月前

PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

transformer conformer speech-translation streaming-asr speech-alignment punctuation-restoration streaming-tts speech-synthesis tts asr speech-recognition voice-cloning vocoder voice-recognition self-supervised-learning Whisper

Python

12169

1930

6 天前

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

speech-recognition speech-toolkit speaker-recognition speech-to-text speech-enhancement speech-separation audio audio-processing speech-processing speechrecognition asr voice-recognition speaker-diarization speaker-verification PyTorch huggingface transformers language-model 深度学习

Python

10296

1542

7 天前

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

voice-detection voice-recognition voice-commands PyTorch onnx voice-activity-detection voice-control onnx-runtime onnxruntime speech speech-processing vad

Python

6580

609

2 个月前

collabora / WhisperLive

A nearly-live implementation of OpenAI's Whisper.

dictation obs openai text-to-speech translation voice-recognition Whisper tensorrt tensorrt-llm whisper-tensorrt openvino

Python

3273

449

1 个月前

theajack / cnchar

🇨🇳 功能全面的汉字工具库 (拼音笔画偏旁成语语音可视化等) (Chinese character util)

draw chinese-characters pinyin voice-recognition

TypeScript

2840

313

5 个月前

coqui-ai / STT

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

stt speech-to-text Tensorflow 深度学习 automatic-speech-recognition asr voice-recognition speech-recognition

C++

2499

297

1 年前

react-native-voice / voice

🎤 React Native Voice Recognition library for iOS and Android (Online and Offline Support)

React Native Android iOS speech-recognition voice-recognition

TypeScript

2031

578

3 个月前

jim-schwoebel / voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

datasets dataset voice data voice-control voice-synthesis voice-commands voice-assistant voice-recognition voice-chat voice-activity-detection voice-conversion noise

1995

246

1 年前

coqui-ai / open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

tts stt speech-to-text text-to-speech speech-recognition speech-synthesis speech-processing voice-recognition voice-activity-detection voice-cloning speech-separation

1352

148

1 年前

TEN-framework / ten-vad

Voice Activity Detector(VAD) from TEN: low-latency, high-performance and lightweight

conversational-ai real-time speech-processing vad voice-activity-detection voice-commands voice-recognition audio automatic-speech-recognition speech silero-vad

1321

111

9 天前

yeyupiaoling / VoiceprintRecognition-Pytorch

This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods

PyTorch voice-recognition arcface speaker-recognition

Python

1085

157

2 个月前