Repository navigation

CUDA

Created by 英伟达

发布于 June 23, 2007

Website: developer.nvidia.com
Wikipedia: 维基百科

CUDA® 是由英伟达公司推出的并行计算平台和编程模型。借助CUDA，开发人员可利用 GPU 的处理能力，大幅提升计算性能。

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

gpt 大语言模型 PyTorch llmops mlops model-serving transformer llm-serving inference llama amd rocm CUDA inferentia trainium tpu xpu hpu deepseek qwen

Python

45263

6937

2 小时前

hashcat / hashcat

World's fastest and most advanced password recovery utility

hashcat password cracking gpgpu opencl C hashes CUDA

22452

3044

8 个月前

NVIDIA / nvidia-docker

Build and run Docker containers leveraging NVIDIA GPUs

nvidia-docker Docker CUDA gpu

Makefile

17355

2034

1 年前

NVlabs / instant-ngp

Instant neural graphics primitives: lightning fast NeRF and more

神经网络机器学习 CUDA nerf computer-graphics 机器视觉 3d-reconstruction signed-distance-functions function-approximation real-time realtime real-time-rendering

Cuda

16509

1966

3 个月前

kaldi-asr / kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

kaldi C++CUDA Shell speech-recognition speech-to-text speaker-verification speaker-id speech

Shell

14779

5348

3 个月前

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

CUDA inference llama llava 大语言模型 llm-serving moe PyTorch transformer vlm llama3 llama3-1 deepseek deepseek-llm deepseek-v3 deepseek-r1 deepseek-r1-zero

Python

13339

1545

2 小时前

isl-org / Open3D

Open3D: A Modern Library for 3D Data Processing

mesh-processing computer-graphics OpenGL C++Python reconstruction odometry visualization registration 机器学习 3D Point cloud rendering GUI 3d-perception gpu arm CUDA PyTorch Tensorflow

C++

12199

2405

2 天前

vosen / ZLUDA

CUDA on non-NVIDIA GPUs

CUDA Rust

Rust

11185

715

11 天前

srush / GPU-Puzzles

Solve puzzles. Learn CUDA.

CUDA 机器学习 puzzles

Jupyter Notebook

10889

841

8 个月前

numba / numba

NumPy aware dynamic Python compiler using LLVM

Python NumPy LLVM 编译器 CUDA parallel numba

Python

10358

1156

10 天前

cupy / cupy

NumPy & SciPy for GPU

CUDA cudnn cublas cusolver nccl Python NumPy cupy curand cusparse gpu SciPy tensor rocm

Python

10126

902

2 天前

rapidsai / cudf

cuDF - GPU DataFrame Library

gpu rapids cudf arrow CUDA pandas dataframe dask 数据分析数据科学 pydata C++Python

C++

8867

941

8 小时前

replicate / cog

Containers for machine learning

containers CUDA 深度学习 Docker 机器学习 PyTorch Tensorflow 人工智能

Python

8553

597

2 天前

catboost / catboost

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

机器学习 decision-trees gradient-boosting gbm gbdt Python R kaggle gpu-computing catboost 教程 categorical-features gpu coreml 数据科学 big-data CUDA data-mining

C++

8359

1216

11 小时前

Oneflow-Inc / oneflow

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

深度学习机器学习深度神经网络 distributed 神经网络 CUDA

C++

7983

874

9 天前

kroma-network / tachyon

Modular ZK(Zero Knowledge) backend accelerated by GPU

C++CUDA zk 区块链 Cryptography kroma tachyon zero-knowledge 加密货币

C++

7769

234

5 个月前

NVIDIA / cuda-samples

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

CUDA cuda-kernels cuda-opengl cuda-driver-api

7317

2006

1 个月前

NVIDIA / cutlass

CUDA Templates for Linear Algebra Subroutines

CUDA 深度学习 deep-learning-library C++Nvidia gpu

C++

7307

1200

9 天前

hybridgroup / gocv

Go package for computer vision using OpenCV 4 and beyond. Includes support for DNN, CUDA, OpenCV Contrib, and OpenVINO.

OpenCV Go Video 机器视觉 video-capture face-tracking mjpeg mjpeg-stream 图像处理 Tensorflow openvino dnn object-tracking CUDA onnx yolo

7011

878

1 天前

chainer / chainer

A flexible framework of neural networks for deep learning

深度学习 Python neural-networks 机器学习 gpu CUDA cudnn NumPy cupy chainer 神经网络

Python

5911

1366

2 年前