Repository navigation

#

ai-infra

🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. 🗃️ Llama3, Mistral, etc. 🧑‍💻 Video Tutorials.

2871
8 个月前

SpargeAttention: A training-free sparse attention that can accelerate any model inference.

Cuda
453
3 天前

This is a landscape of the infrastructure that powers the generative AI ecosystem

HTML
142
6 个月前

AI 基础知识 - GPU 架构、CUDA 编程以及大模型基础知识

Jupyter Notebook
120
4 天前

vgpu.rs is the fractional GPU & vgpu-hypervisor implementation written in Rust

Rust
11
1 天前

This repository contains a list of various service-specific Azure Landing Zone implementation options.

10
3 个月前

OriginDL: A distributed deep learning framework Built from scratch

C++
10
7 天前

Memory Management Service, a Long Term Memory Solution for AI

Python
8
8 个月前

hpc 教程,包含集合通信(mpi、nccl)、cuda 编程、向量化 SIMD、RDMA 通信等

Cuda
8
5 天前

A curated list of awesome tools, frameworks, platforms, and resources for building scalable and efficient AI infrastructure, including distributed training, model serving, MLOps, and deployment.

Python
6
5 个月前

TME: Structured memory engine for LLM agents to plan, rollback, and reason across multi-step tasks. DAG upgrade in progress.

Python
6
4 天前

visualize ai omegacycle

HTML
1
3 个月前

Persistent Naming for Mellanox Ethernet Interfaces

Python
0
24 天前

TME: Structured memory engine for LLM agents to plan, rollback, and reason across multi-step tasks. DAG upgrade in progress.

Python
0
1 天前