Repository navigation

#

string-search

ashvardanian/StringZilla

Up to 100x faster strings for C, C++, CUDA, Python, Rust, Swift, JS, & Go, leveraging NEON, AVX2, AVX-512, SVE, GPGPU, & SWAR to accelerate search, hashing, sorting, edit distances, sketches, and memory ops 🦖

C
2874
10 小时前

Find parts of long text or data, allowing for some changes/typos.

Python
330
4 个月前

Aho-Corasick multi-keyword string searching library in PHP.

PHP
186
7 年前

WuManber text search/matching implementation using c#

C#
154
3 年前

IDAPython plugin for finding function strings recursively

Python
132
3 年前

Find multi-byte-encoded strings in binary data (Gitlab mirror).

Rust
125
1 年前

Rust edit distance routines accelerated using SIMD. Supports fast Hamming, Levenshtein, restricted Damerau-Levenshtein, etc. distance calculations and string search.

Rust
108
3 年前

Comparing performance-oriented string-processing libraries for substring search, multi-pattern matching, hashing, edit-distances, sketching, and sorting across CPUs and GPUs in Rust 🦀 and Python 🐍

Rust
91
1 天前

Aho-corasick for javascript.

JavaScript
83
1 年前

A string library

Nim
81
3 年前

Accelerating the deduplication and collapsing process for reads with Unique Molecular Identifiers (UMI). Heavily optimized for scalability and orders of magnitude faster than a previous tool.

Java
79
2 年前

Aho-Corasick multi-string search for .NET and SQL Server.

C#
67
7 个月前

Twoway / Fast substring search for strings and byte strings (Rust) / Also assorted benchmarks and string search snippets

Rust
64
4 年前

String matching algorithms for searching a single or multiple strings in large texts

Java
47
1 年前

String matching algorithm benchmark

Java
37
1 年前

📉 JavaScript Text Statistics that counts lines, words, chars, and spaces.

JavaScript
36
3 年前

Implementation of Aho-Corasick string matching algorithm for .NET

C#
31
9 年前

A high performance library to match discontinuous strings.

TypeScript
28
2 年前

Collection of techniques for (binary) string searching

Rust
24
6 年前