Repository navigation

#

ai-infra

🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. 🗃️ Llama3, Mistral, etc. 🧑‍💻 Video Tutorials.

3312
2 个月前

A.I.G (AI-Infra-Guard) is a comprehensive, intelligent, and easy-to-use AI Red Teaming platform developed by Tencent Zhuque Lab.

Python
1798
7 天前

SpargeAttention: A training-free sparse attention that can accelerate any model inference.

Cuda
732
7 天前

AI 基础知识 - GPU 架构、CUDA 编程、大模型基础及AI Agent 相关知识

HTML
513
3 天前

RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.

Python
456
4 天前

This is a landscape of the infrastructure that powers the generative AI ecosystem

HTML
149
1 年前

Multi-Level Triton Runner supporting Python, IR, PTX, and cubin.

Python
70
3 天前

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse–Linear Attention

49
2 天前

LLM Inference via Triton (Flexible & Modular): Focused on Kernel Optimization using CUBIN binaries, Starting from gpt-oss Model

Python
45
1 个月前

hpc 教程,包含集合通信(mpi、nccl)、cuda 编程、向量化 SIMD、RDMA 通信等

Cuda
39
18 天前

TME: Structured memory engine for LLM agents to plan, rollback, and reason across multi-step tasks.

Python
33
2 个月前

vgpu.rs is the fractional GPU & vgpu-hypervisor implementation written in Rust

Rust
25
5 天前

A curated list of awesome tools, frameworks, platforms, and resources for building scalable and efficient AI infrastructure, including distributed training, model serving, MLOps, and deployment.

Python
19
2 个月前

Triton for OpenCL backend, and use mlir-translate to get source OpenCL code

MLIR
13
1 个月前

OriginDL: A distributed deep learning framework Built from scratch

C++
10
5 天前

This repository contains a list of various service-specific Azure Landing Zone implementation options.

10
5 个月前

Memory Management Service, a Long Term Memory Solution for AI

Python
8
1 年前