Repository navigation

#

matrix-multiplication

Library for specialized dense and sparse matrix operations, and deep learning primitives.

C
911
3 天前

BLISlab: A Sandbox for Optimizing GEMM

C
540
4 年前

Introduction to PyTorch, covering tensor initialization, operations, indexing, and reshaping.

Jupyter Notebook
505
7 天前

Multi-Threaded FP32 Matrix Multiplication on x86 CPUs

C
357
5 个月前

The HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats and integers

Nim
290
2 年前

💥 Fast matrix-multiplication as a self-contained Python library – no system dependencies!

C
227
4 个月前

Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm

C++
209
5 个月前

Sparse matrix formats for linear algebra supporting scientific and machine learning applications

Go
165
4 年前

Accelerated General (FP32) Matrix Multiplication from scratch in CUDA

Cuda
159
9 个月前

DBCSR: Distributed Block Compressed Sparse Row matrix library

Fortran
144
5 天前
Assembly
111
1 天前