Repository navigation
#
ml-efficiency
- Website
- Wikipedia
Supercharge Your Model Training
Python
5341
3 天前
(Unofficial) building Hugging Face SmolLM-blazingly fast and small language model with PyTorch implementation of grouped query attention (GQA)
Python
1
3 个月前