Repository navigation

#

kvcache

kvcache-ai/Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++
3784
4 小时前

R-KV: Redundancy-aware KV Cache Compression for Reasoning Models

Python
1100
12 天前

kvcached: Elastic KV cache for dynamic GPU sharing and efficient multi-LLM inference.

Python
58
2 天前

(ACL 2025 oral) SCOPE: Optimizing KV Cache Compression in Long-context Generation

Jupyter Notebook
32
3 个月前

PiKV: KV Cache Management System for Mixture of Experts [Efficient ML System]

Python
29
3 天前

Span Queries: What if we had a way to plan and optimize GenAI like we do for SQL?

Rust
5
16 小时前

This project implements an Emotion-Aware Music Generator (EAMG) that turns natural-language prompts into emotion-aligned music in real time. It uses a LoRA-tuned DistilBERT to classify emotions, maps them to musical parameters using music theory, and generates MIDI via a transformer model with KV caching for low-latency output.

Jupyter Notebook
0
1 个月前