Repository navigation

#

openai-triton

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python
3633
2 天前

https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

Python
1287
6 个月前

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Python
576
2 个月前

Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.

Python
148
4 天前

Learn and experiment with new techniques and programming languages with a focus on ML

Jupyter Notebook
9
1 个月前