Repository navigation
mlsys
- Website
- Wikipedia
🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. 🗃️ Llama3, Mistral, etc. 🧑💻 Video Tutorials.
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
Distributed RL System for LLM Reasoning
Quantized Attention achieves speedup of 2-5x and 3-11x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models.
SpargeAttention: A training-free sparse attention that can accelerate any model inference.
FedScale is a scalable and extensible open-source federated learning (FL) platform.
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
Machine Learning Framework for Operating Systems - Brings ML to Linux kernel
An acceleration library that supports arbitrary bit-width combinatorial quantization operations
A scalable & efficient active learning/data selection system for everyone.
🤖FFPA: Extend FlashAttention-2 with Split-D, ~O(1) SRAM complexity for large headdim, 1.8x~3x↑🎉 vs SDPA EA.
A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems
Optimal Sparse Decision Trees
Federated Learning Systems Paper List
NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference