Repository navigation

#

ai-infra

🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. 🗃️ Llama3, Mistral, etc. 🧑‍💻 Video Tutorials.

3200
1 个月前

SpargeAttention: A training-free sparse attention that can accelerate any model inference.

Cuda
684
7 天前

AI 基础知识 - GPU 架构、CUDA 编程、大模型基础及AI Agent 相关知识

HTML
243
21 小时前

This is a landscape of the infrastructure that powers the generative AI ecosystem

HTML
148
10 个月前

hpc 教程,包含集合通信(mpi、nccl)、cuda 编程、向量化 SIMD、RDMA 通信等

Cuda
32
1 个月前

TME: Structured memory engine for LLM agents to plan, rollback, and reason across multi-step tasks.

Python
31
19 天前

Triton multi-level runner, include cubin, ptx, ttgir etc.

Python
24
3 天前

vgpu.rs is the fractional GPU & vgpu-hypervisor implementation written in Rust

Rust
20
1 天前

A curated list of awesome tools, frameworks, platforms, and resources for building scalable and efficient AI infrastructure, including distributed training, model serving, MLOps, and deployment.

Python
14
21 天前

OriginDL: A distributed deep learning framework Built from scratch

C++
10
1 个月前

This repository contains a list of various service-specific Azure Landing Zone implementation options.

10
4 个月前

Triton for OpenCL backend, and use mlir-translate to get source OpenCL code

MLIR
9
1 个月前

Memory Management Service, a Long Term Memory Solution for AI

Python
8
1 年前

TME: Structured memory engine for LLM agents to plan, rollback, and reason across multi-step tasks. DAG upgrade in progress.

Python
1
3 个月前

visualize ai omegacycle

HTML
1
7 个月前