Repository navigation
#
blackwell
- Website
- Wikipedia
SGLang is a fast serving framework for large language models and vision language models.
Python
17022
18 分钟前
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in performant way.
C++
11370
20 分钟前
Prebuilt DeepSpeed wheels for Windows with NVIDIA GPU support. Supports GTX 10 - RTX 50 series. Compiled with pytorch 2.7, 2.8 and cuda 12.8
1
1 天前
Repository for Campbells-Luggs-Blackwells family history web site
HTML
0
3 年前
Cuda
0
2 个月前