Repository navigation
#
openai-triton
- Website
- Wikipedia
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
Python
3529
4 小时前
https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
Python
1282
5 个月前
A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.
Python
568
8 天前
Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.
Python
145
8 小时前
Learn and experiment with new techniques and programming languages with a focus on ML
Jupyter Notebook
8
1 个月前