Repository navigation
#
large-large-models
- Website
- Wikipedia
FlashInfer: Kernel Library for LLM Serving
Cuda
3843
9 小时前
research done at Prof Hung's Lab at MBZUAI
Python
2
1 年前
FlashInfer: Kernel Library for LLM Serving
research done at Prof Hung's Lab at MBZUAI