Repository navigation

#

blackwell

Python
17022
18 分钟前

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in performant way.

C++
11370
20 分钟前

Prebuilt DeepSpeed wheels for Windows with NVIDIA GPU support. Supports GTX 10 - RTX 50 series. Compiled with pytorch 2.7, 2.8 and cuda 12.8

1
1 天前

Repository for Campbells-Luggs-Blackwells family history web site

HTML
0
3 年前

Pytorch Operation for distributed gemm in nvidia blackwell gpus

Cuda
0
2 个月前