Repository navigation
cublas
- Website
- Wikipedia
SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.
Safe rust wrapper around CUDA toolkit
🚀🚀🚀 This repository lists some awesome public CUDA, cuda-python, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, TVM, MLIR, PTX and High Performance Computing (HPC) projects.
Algorithms implemented in CUDA + resources about GPGPU
Harness the power of GPU acceleration for fusing visual odometry and IMU data with an advanced Unscented Kalman Filter (UKF) implementation. Developed in C++ and utilizing CUDA, cuBLAS, and cuSOLVER, this system offers unparalleled real-time performance in state and covariance estimation for robotics and autonomous system applications.
Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.
code for benchmarking GPU performance based on cublasSgemm and cublasHgemm
bilibili视频【CUDA 12.x 并行编程入门(C++版)】配套代码
Bandicoot: C++ library for GPU linear algebra & scientific computing - https://coot.sourceforge.io