Repository navigation

triton

Website
Wikipedia

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

llm-training triton finetuning gemma2 llama llama3 llms mistral phi3

Python

4880

309

3 天前

ELS-RD / kernl

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

CUDA PyTorch triton transformer

Jupyter Notebook

1565

1 年前

thu-ml / SageAttention

Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.

attention 大语言模型 quantization CUDA triton video-generation mlsys

Cuda

1342

4 天前

TritonDataCenter / containerpilot

A service for autodiscovery and configuration of applications running in containers

containerpilot consul orchestration containers Docker joyent service-discovery triton

1133

134

2 年前

JonathanSalwan / Tigress_protection

Playing with the Tigress software protection. Break some of its protections and solve their reverse engineering challenges. Automatic deobfuscation using symbolic execution, taint analysis and LLVM.

deobfuscation triton symbolic-execution LLVM 逆向工程 taint-analysis

LLVM

826

143

1 年前

coderonion / awesome-llm-and-aigc

🚀🚀🚀A collection of some wesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VLA), AI Generated Content(AIGC), the related Datasets and Applications.

ChatGPT gpt large-language-models 大语言模型 Awesome Lists llama aigc langchain datasets yolo triton CUDA vlm deepseek qwen mllm ai4science

662

5 天前

BobMcDear / attorch

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

CUDA 深度学习机器学习 PyTorch triton openai openai-triton

Python

532

1 天前

FlagOpen / FlagGems

FlagGems is an operator library for large language models implemented in Triton Language.

PyTorch triton

Python

493

1 天前

JafarAkhondali / acer-predator-turbo-and-rgb-keyboard-linux-module

Linux kernel module to support Turbo mode and RGB Keyboard for Acer Predator notebook series

turbo acer helios triton led Linux rgb Hacktoberfest

440

4 个月前

rkinas / triton-resources

A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.

CUDA triton

Python

336

1 个月前

d4em0n / exrop

Automatic ROPChain Generation

rop exploitdev ctf binary-exploitation 逆向工程 exploit-development pwn triton rop-gadgets rop-exploitation symbolic-execution

Python

285

5 年前

coderonion / awesome-cuda-and-hpc

🚀🚀🚀 This repository lists some awesome public CUDA, cuda-python, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, TVM, MLIR, PTX and High Performance Computing (HPC) projects.

CUDA cublas tensorrt Awesome Lists 大语言模型 gpu blas PyTorch hpc gemm llama cudnn triton tensorrt-llm cutlass mlir tvm deepseek ptx vlm

248

1 小时前

Colton1skees / Dna

LLVM based static binary analysis framework

analysis binary deobfuscation instruction-semantics lifter program-analysis triton LLVM llvm-ir static-analysis x86 x86-64

C++

239

17 天前

opendilab / DI-hpc

OpenDILab RL HPC OP Lib, including CUDA and Triton kernel

reinforcement-learning CUDA hpc lstm PyTorch triton

Python

226

10 个月前

SQLab / symgdb

SymGDB - symbolic execution plugin for gdb

gdb gdb-plugin symbolic-execution triton

Python

216

7 年前

kakaobrain / trident

A performance library for machine learning applications.

人工智能 Library performance triton 深度学习 Python PyTorch 机器学习

Python

184

2 年前

mmsaeed509 / bspwm-dots

Ozoz dotfiles for bspwm, i3WM

Arch Linux bspwm polybar rofi dotfiles i3wm Linux acer helios Neovim triton turbo neofetch

Shell

167

9 个月前

clearml / clearml-serving

ClearML - Model-Serving Orchestration and Repository Solution

机器学习 mlops DevOps 深度学习 Kubernetes 人工智能 model-serving serving triton triton-inference-server

Python

149

3 个月前

NVIDIA-ISAAC-ROS / isaac_ros_object_detection

NVIDIA-accelerated, deep learned model support for image space object detection

ros2 object-detection inference 深度学习 Nvidia triton 机器学习 tensorrt ros2-humble ros gpu jetson

C++

148

2 个月前

novioleo / Savior

(WIP)The deployment framework aims to provide a simple, lightweight, fast integrated, pipelined deployment framework for algorithm service that ensures reliability, high concurrency and scalability of services.

workflow 深度学习部署 triton rpa distributed

Python

137

4 年前