Repository navigation

#

efficient-llm

[ICLR-2025-SLLM Spotlight 🔥]MobiLlama : Small Language Model tailored for edge devices

Python
661
5 个月前

[ICML 2024] CLLMs: Consistency Large Language Models

Python
403
1 年前

[NAACL' 25 main] Lillama: Large Language Model Compression via Low-Rank Feature Distillation

Python
11
6 个月前

There is a summary repo for Efficient AI direction. If you want to contribute to this repo, feel free to pr(pull request)!

6
1 年前

Colab-friendly BitNet distillation engine: collect KD traces from a teacher, train a ternary Mini-BitNet, and dry-run 7B memory. Multi-provider + Drive/S3

Python
0
1 个月前