Repository navigation

#

kvcache

kvcache-ai/Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++
4049
2 小时前

R-KV: Redundancy-aware KV Cache Compression for Reasoning Models

Python
1123
1 个月前
Python
98
2 天前

PiKV: KV Cache Management System for Mixture of Experts [Efficient ML System]

Python
37
8 天前

(ACL 2025 oral) SCOPE: Optimizing KV Cache Compression in Long-context Generation

Jupyter Notebook
33
4 个月前

Span Queries: What if we had a way to plan and optimize GenAI like we do for SQL?

Rust
9
2 小时前

This project implements an Emotion-Aware Music Generator (EAMG) that turns natural-language prompts into emotion-aligned music in real time. It uses a LoRA-tuned DistilBERT to classify emotions, maps them to musical parameters using music theory, and generates MIDI via a transformer model with KV caching for low-latency output.

Jupyter Notebook
0
3 个月前