Repository navigation

#

tvm

Universal LLM Deployment Engine with ML Compilation

Python
20437
14 天前

High-performance In-browser LLM Inference Engine

TypeScript
15256
3 个月前

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python
12219
1 天前

Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.

Jupyter Notebook
3655
1 年前

TVM Documentation in Chinese Simplified / TVM 中文文档

TypeScript
1019
5 天前

AutoKernel 是一个简单易用,低门槛的自动算子优化工具,提高深度学习算法部署效率。

C++
737
3 年前

yolort is a runtime stack for yolov5 on specialized accelerators such as tensorrt, libtorch, onnxruntime, tvm and ncnn.

Python
729
20 天前

🗣️ Chat with LLM like Vicuna totally in your browser with WebGPU, safely, privately, and with no server. Powered by web llm.

JavaScript
632
8 个月前

Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.

Python
431
2 年前

TON Foundation invites talent to imagine and realize projects that have the potential to integrate with the daily lives of users.

387
1 个月前

Open, Modular, Deep Learning Accelerator

Scala
285
1 年前

🚀🚀🚀 This repository lists some awesome public CUDA, cuda-python, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, TVM, MLIR, PTX and High Performance Computing (HPC) projects.

249
1 天前

Golang SDK for from Tonkeeper team

Go
243
2 天前

比做算法的懂工程落地,比做工程的懂算法模型。

Jupyter Notebook
241
3 个月前

Optimizing Mobile Deep Learning on ARM GPU with TVM

C
181
7 年前

A home for the final text of all TVM RFCs.

102
7 个月前

convert torch module to tensorrt network or tvm function

Python
88
5 年前