Repository navigation
ai-infra
- Website
- Wikipedia
🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. 🗃️ Llama3, Mistral, etc. 🧑💻 Video Tutorials.
SpargeAttention: A training-free sparse attention that can accelerate any model inference.
Transform your pythonic research to an artifact that engineers can deploy easily.
This is a landscape of the infrastructure that powers the generative AI ecosystem
Cloud Native ML/DL Platform
AI 基础知识 - GPU 架构、CUDA 编程以及大模型基础知识
vgpu.rs is the fractional GPU & vgpu-hypervisor implementation written in Rust
This repository contains a list of various service-specific Azure Landing Zone implementation options.
OriginDL: A distributed deep learning framework Built from scratch
Memory Management Service, a Long Term Memory Solution for AI
A curated list of awesome tools, frameworks, platforms, and resources for building scalable and efficient AI infrastructure, including distributed training, model serving, MLOps, and deployment.
TME: Structured memory engine for LLM agents to plan, rollback, and reason across multi-step tasks. DAG upgrade in progress.
Persistent Naming for Mellanox Ethernet Interfaces
Decenteralized AI training platform for all
TME: Structured memory engine for LLM agents to plan, rollback, and reason across multi-step tasks. DAG upgrade in progress.