Repository navigation

#

avx-512

Roaring bitmaps in C (and C++), with SIMD (AVX2, AVX-512 and NEON) optimizations: used by Apache Doris, ClickHouse, Alibaba Tair, Redpanda, YDB and StarRocks

C
1693
10 小时前
simdutf/simdutf

Unicode routines (UTF8, UTF16, UTF32) and Base64: billions of characters per second using SSE2, AVX2, NEON, AVX-512, RISC-V Vector Extension, LoongArch64, POWER. Part of Node.js, WebKit/Safari, Ladybird, Chromium, Cloudflare Workers and Bun.

C++
1520
8 天前

Portable wrapper for SIMD and vector instructions written in C++11. Compatible with NEON, SSE, AVX, AVX-512 and SVE (length specific).

C++
510
3 个月前

Intel Homomorphic Encryption Acceleration Library accelerates modular arithmetic operations used in homomorphic encryption by leveraging AVX512 and IFM52 available on Intel's 3rd Generation Xeon Scalable Processors and later

C++
247
3 个月前

A curated list of awesome SIMD frameworks, libraries and software

207
1 年前

A general purpose machine code manipulation library for x86-32 (IA-32) and x86-64 (AMD64) architectures (Assembler, Disassembler, Library).

C
91
2 年前

Fast C++ function "is_utf8": checks if the input is valid UTF-8. Made of a single source file. Optimized for ARM NEON, x64 SSE, AVX2 and AVX-512.

C++
67
1 年前

Masm compatible assembler

Assembly
61
5 小时前

AVX-512 documentation beyond what Intel provides

56
2 年前

The fastest Run-Length-Encoding on the Planet (for x64)

C
38
7 个月前

Algorithms for matrix matrix multiplication, dgemm, AVX-256, AVX-512

C++
20
9 个月前

AVX-Hole C++ SIMD Library

C++
16
2 个月前

A generic and efficient SIMD implementation of MSB Radix Sort with separate key and payload datastreams that supports arbitrary key and payload data types written in C++ accompanied by a bachelor's thesis.

C++
15
8 个月前

DedupBench is a benchmarking tool for data chunking techniques used in data deduplication. DedupBench is designed for extensibility, allowing new chunking techniques to be implemented with minimal additional code.

C++
13
2 个月前

Benchmark to show which is the fastest memcpy.

Assembly
12
5 年前

Vector Dossier is a CLI tool that statically analyzes vectorization depth of programs and libraries

Jupyter Notebook
10
8 个月前

Utility that was used to generate initial Go AVX-512 encoder test suite.

Assembly
9
6 年前

Running GPGPU-like kernels on CPU with auto-vectorization for SSE/AVX/AVX512 SIMD Architectures

C++
9
2 年前