Repository navigation
mlsys
- Website
- Wikipedia
🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. 🗃️ Llama3, Mistral, etc. 🧑💻 Video Tutorials.
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.
Distributed RL System for LLM Reasoning
SpargeAttention: A training-free sparse attention that can accelerate any model inference.
FedScale is a scalable and extensible open-source federated learning (FL) platform.
Machine Learning Framework for Operating Systems - Brings ML to Linux kernel
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
An acceleration library that supports arbitrary bit-width combinatorial quantization operations
A scalable & efficient active learning/data selection system for everyone.
📚FFPA(Split-D): Yet another Faster Flash Attention with O(1) GPU SRAM complexity large headdim, 1.8x~3x↑🎉 faster than SDPA EA.
A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems
Optimal Sparse Decision Trees
Federated Learning Systems Paper List
sensAI: ConvNets Decomposition via Class Parallelism for Fast Inference on Live Data
NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference