Repository navigation
ai-infra
- Website
- Wikipedia
🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. 🗃️ Llama3, Mistral, etc. 🧑💻 Video Tutorials.
SpargeAttention: A training-free sparse attention that can accelerate any model inference.
High performance distributed cache system. Built by Rust.
AI 基础知识 - GPU 架构、CUDA 编程、大模型基础及AI Agent 相关知识
Transform your pythonic research to an artifact that engineers can deploy easily.
This is a landscape of the infrastructure that powers the generative AI ecosystem
Cloud Native ML/DL Platform
TME: Structured memory engine for LLM agents to plan, rollback, and reason across multi-step tasks.
vgpu.rs is the fractional GPU & vgpu-hypervisor implementation written in Rust
A curated list of awesome tools, frameworks, platforms, and resources for building scalable and efficient AI infrastructure, including distributed training, model serving, MLOps, and deployment.
OriginDL: A distributed deep learning framework Built from scratch
This repository contains a list of various service-specific Azure Landing Zone implementation options.
Triton for OpenCL backend, and use mlir-translate to get source OpenCL code
Memory Management Service, a Long Term Memory Solution for AI
TME: Structured memory engine for LLM agents to plan, rollback, and reason across multi-step tasks. DAG upgrade in progress.
Decenteralized AI training platform for all