Repository navigation
avx-512
- Website
- Wikipedia
Performance-portable, length-agnostic SIMD with runtime dispatch
Roaring bitmaps in C (and C++), with SIMD (AVX2, AVX-512 and NEON) optimizations: used by Apache Doris, ClickHouse, Redpanda, YDB and StarRocks
Unicode routines (UTF8, UTF16, UTF32) and Base64: billions of characters per second using SSE2, AVX2, NEON, AVX-512, RISC-V Vector Extension, LoongArch64, POWER. Part of Node.js, WebKit/Safari, Ladybird, Chromium, Cloudflare Workers and Bun.
Intel Homomorphic Encryption Acceleration Library accelerates modular arithmetic operations used in homomorphic encryption by leveraging AVX512 and IFM52 available on Intel's 3rd Generation Xeon Scalable Processors and later
A curated list of awesome SIMD frameworks, libraries and software
AVX-512 documentation beyond what Intel provides
(REOS) Radar and ElectroOptical Simulation Framework written in Fortran.
The fastest Run-Length-Encoding on the Planet (for x64)
Algorithms for matrix matrix multiplication, dgemm, AVX-256, AVX-512
DedupBench is a benchmarking tool for data chunking techniques used in data deduplication. DedupBench is designed for extensibility, allowing new chunking techniques to be implemented with minimal additional code.
Vector Dossier is a CLI tool that statically analyzes vectorization depth of programs and libraries
Running GPGPU-like kernels on CPU with auto-vectorization for SSE/AVX/AVX512 SIMD Architectures