Repository navigation
#
speechllm
- Website
- Wikipedia
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Python
12899
4 天前
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recognition capability.
Python
1471
13 天前
The Interspeech 2025 Multilingual Conversational Speech LLM (MLC-SLM) Challenge
8
6 个月前
TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks
Python
2
1 天前
SHALLOW, the first hallucination benchmark for ASR models
Python
0
4 个月前