Repository navigation

#

tvm

Universal LLM Deployment Engine with ML Compilation

Python
21156
5 天前

High-performance In-browser LLM Inference Engine

TypeScript
16245
3 个月前

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python
12540
18 小时前

Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.

Jupyter Notebook
3678
1 年前

TVM Documentation in Chinese Simplified / TVM 中文文档

TypeScript
2117
4 个月前

AutoKernel 是一个简单易用,低门槛的自动算子优化工具,提高深度学习算法部署效率。

C++
741
3 年前

yolort is a runtime stack for yolov5 on specialized accelerators such as tensorrt, libtorch, onnxruntime, tvm and ncnn.

Python
730
8 天前

🗣️ Chat with LLM like Vicuna totally in your browser with WebGPU, safely, privately, and with no server. Powered by web llm.

JavaScript
633
1 年前

Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.

Python
442
2 年前

TON Foundation invites talent to imagine and realize projects that have the potential to integrate with the daily lives of users.

420
1 个月前

🚀🚀🚀 This repository lists some awesome public CUDA, cuda-python, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, TVM, MLIR, PTX and High Performance Computing (HPC) projects.

311
18 天前

Open, Modular, Deep Learning Accelerator

Scala
303
1 年前

Golang SDK for from Tonkeeper team

Go
264
1 天前

比做算法的懂工程落地,比做工程的懂算法模型。

Jupyter Notebook
250
3 个月前

Optimizing Mobile Deep Learning on ARM GPU with TVM

C
181
7 年前

A home for the final text of all TVM RFCs.

105
1 年前

convert torch module to tensorrt network or tvm function

Python
89
6 年前