Repository navigation

#

mlsys

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook
13239
1 天前

🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. 🗃️ Llama3, Mistral, etc. 🧑‍💻 Video Tutorials.

2871
8 个月前

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Cuda
1452
8 小时前

Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.

Cuda
1342
4 天前
Python
1125
12 天前

SpargeAttention: A training-free sparse attention that can accelerate any model inference.

Cuda
453
3 天前

A model compilation solution for various hardware

MLIR
424
3 天前

FedScale is a scalable and extensible open-source federated learning (FL) platform.

Python
397
1 年前

Measure and optimize the energy consumption of your AI applications!

Python
245
2 天前

Machine Learning Framework for Operating Systems - Brings ML to Linux kernel

C
243
3 年前

The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)

HTML
223
3 个月前

An acceleration library that supports arbitrary bit-width combinatorial quantization operations

C++
221
7 个月前

A scalable & efficient active learning/data selection system for everyone.

Python
214
9 个月前

📚FFPA(Split-D): Yet another Faster Flash Attention with O(1) GPU SRAM complexity large headdim, 1.8x~3x↑🎉 faster than SDPA EA.

Cuda
168
13 天前

A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems

Python
160
6 个月前

Materials for my 2021 NYU class on NLP and ML Systems (Master of Engineering).

Jupyter Notebook
96
2 年前
Python
64
4 个月前