Repository navigation

#

matrix-multiplication

Library for specialized dense and sparse matrix operations, and deep learning primitives.

C
891
16 小时前

BLISlab: A Sandbox for Optimizing GEMM

C
534
4 年前

Introduction to PyTorch, covering tensor initialization, operations, indexing, and reshaping.

Jupyter Notebook
484
1 个月前

Multi-Threaded FP32 Matrix Multiplication on x86 CPUs

C
351
4 个月前

The HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats and integers

Nim
288
2 年前

💥 Fast matrix-multiplication as a self-contained Python library – no system dependencies!

C
225
2 个月前

Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm

C++
208
3 个月前

Sparse matrix formats for linear algebra supporting scientific and machine learning applications

Go
165
4 年前

DBCSR: Distributed Block Compressed Sparse Row matrix library

Fortran
145
1 天前

Accelerated General (FP32) Matrix Multiplication from scratch in CUDA

Cuda
122
7 个月前
Assembly
111
21 小时前