Repository navigation

#

avx-512

Roaring bitmaps in C (and C++), with SIMD (AVX2, AVX-512 and NEON) optimizations: used by Apache Doris, ClickHouse, Redpanda, YDB and StarRocks

C
1673
2 天前
simdutf/simdutf

Unicode routines (UTF8, UTF16, UTF32) and Base64: billions of characters per second using SSE2, AVX2, NEON, AVX-512, RISC-V Vector Extension, LoongArch64, POWER. Part of Node.js, WebKit/Safari, Ladybird, Chromium, Cloudflare Workers and Bun.

C++
1491
20 小时前

Portable wrapper for SIMD and vector instructions written in C++11. Compatible with NEON, SSE, AVX, AVX-512 and SVE (length specific).

C++
507
2 个月前

Intel Homomorphic Encryption Acceleration Library accelerates modular arithmetic operations used in homomorphic encryption by leveraging AVX512 and IFM52 available on Intel's 3rd Generation Xeon Scalable Processors and later

C++
240
1 个月前

A curated list of awesome SIMD frameworks, libraries and software

201
1 年前

A general purpose machine code manipulation library for x86-32 (IA-32) and x86-64 (AMD64) architectures (Assembler, Disassembler, Library).

C
91
2 年前

Fast C++ function "is_utf8": checks if the input is valid UTF-8. Made of a single source file. Optimized for ARM NEON, x64 SSE, AVX2 and AVX-512.

C++
67
1 年前

Masm compatible assembler

Assembly
61
2 天前

AVX-512 documentation beyond what Intel provides

54
2 年前

The fastest Run-Length-Encoding on the Planet (for x64)

C
38
5 个月前

Algorithms for matrix matrix multiplication, dgemm, AVX-256, AVX-512

C++
19
7 个月前

AVX-Hole C++ SIMD Library

C++
16
25 天前

A generic and efficient SIMD implementation of MSB Radix Sort with separate key and payload datastreams that supports arbitrary key and payload data types written in C++ accompanied by a bachelor's thesis.

C++
15
7 个月前

Benchmark to show which is the fastest memcpy.

Assembly
11
5 年前

DedupBench is a benchmarking tool for data chunking techniques used in data deduplication. DedupBench is designed for extensibility, allowing new chunking techniques to be implemented with minimal additional code.

C++
10
6 天前

Vector Dossier is a CLI tool that statically analyzes vectorization depth of programs and libraries

Jupyter Notebook
9
7 个月前

Utility that was used to generate initial Go AVX-512 encoder test suite.

Assembly
9
6 年前

Running GPGPU-like kernels on CPU with auto-vectorization for SSE/AVX/AVX512 SIMD Architectures

C++
9
2 年前