Repository navigation

#

speechllm

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python
12899
4 天前

Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recognition capability.

Python
1471
13 天前

The Interspeech 2025 Multilingual Conversational Speech LLM (MLC-SLM) Challenge

8
6 个月前

TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks

Python
2
1 天前

SHALLOW, the first hallucination benchmark for ASR models

Python
0
4 个月前