Repository navigation
machine-learning-systems
- Website
- Wikipedia
🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton
[TMLR 2024] Efficient Large Language Models: A Survey
Distributed RL System for LLM Reasoning
Infrastructures™ for Machine Learning Training/Inference in Production.
Curated collection of papers in machine learning systems
Learn how to design and implement effective Machine Learning systems from start to finish.
Dive into machine learning system, start from reinventing the wheel.
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
Oort: Efficient Federated Learning via Guided Participant Selection
a curated list of high-quality papers on resource-efficient LLMs 🌱
Here are my personal paper reading notes (including cloud computing, resource management, systems, machine learning, deep learning, and other interesting stuffs).
Course Material for the UG Course COMP4901Y
Triton implement of bi-directional (non-causal) linear attention
Efficient Diffusion Models: A Survey
Machine Learning Compiler Road Map
CSCE 585 - Machine Learning Systems
A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup
A C++ implementation of the scalar-valued autograd engine micrograd
A curated list of resources to deep dive into the intersection of applied machine learning and threat detection.