Repository navigation
#
kvcache
- Website
- Wikipedia
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
C++
3784
4 小时前
Python
1100
12 天前
kvcached: Elastic KV cache for dynamic GPU sharing and efficient multi-LLM inference.
Python
58
2 天前
(ACL 2025 oral) SCOPE: Optimizing KV Cache Compression in Long-context Generation
Jupyter Notebook
32
3 个月前
PiKV: KV Cache Management System for Mixture of Experts [Efficient ML System]
Python
29
3 天前
Span Queries: What if we had a way to plan and optimize GenAI like we do for SQL?
Rust
5
16 小时前
This project implements an Emotion-Aware Music Generator (EAMG) that turns natural-language prompts into emotion-aligned music in real time. It uses a LoRA-tuned DistilBERT to classify emotions, maps them to musical parameters using music theory, and generates MIDI via a transformer model with KV caching for low-latency output.
Jupyter Notebook
0
1 个月前