Repository navigation

streaming-asr

Website
Wikipedia

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

transformer conformer speech-translation streaming-asr speech-alignment punctuation-restoration streaming-tts speech-synthesis tts asr speech-recognition voice-cloning vocoder voice-recognition self-supervised-learning Whisper

Python

12169

1930

6 天前

yeyupiaoling / PPASR

基于PaddlePaddle实现端到端中文语音识别，从入门到实战，超简单的入门案例，超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型

asr paddlepaddle 深度学习 chinese speech-to-text speech speech-recognition streaming-asr conformer

Python

865

131

2 个月前

mllpresearch / Europarl-ASR

A 1300-hour English speech and text corpus of parliamentary debates for streaming ASR training and benchmarking, speech data filtering and speech data verbatimization.

automatic-speech-recognition streaming-asr

1 年前

gonsalet / ASR_and_MT_for_educational_parliamentary_and_broadcast_media

PhD Thesis: "Automatic speech recognition and machine translation with deep neural networks for open educational resources, parliamentary contents and broadcast media" (2024)

automatic-speech-recognition neural-machine-translation streaming-asr

8 个月前

nirnaim / faster-whisper-server

Faster-Whisper Transcription Server & API is a production-ready speech-to-text micro-service stack that wraps faster-whisper with a streaming FastAPI server, a Celery/Redis background queue, and optional Docker deployment—delivering real-time or batch audio transcription with minimal latency and simple web-hook integration.

人工智能 celery Docker FastAPI 机器学习 Python server-sent-events speech-recognition speech-to-text streaming-asr Whisper

Python

3 个月前