Repository navigation
#
speechllm
- Website
- Wikipedia
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Python
12114
5 天前
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recognition capability.
Python
1255
5 个月前
SHALLOW, the first hallucination benchmark for ASR models
Python
0
3 个月前