Repository navigation

#

string-similarity

rapidfuzz/RapidFuzz

Rapid fuzzy string matching in Python using various string metrics

Python
3036
8 天前

Finds degree of similarity between two strings, based on Dice's Coefficient, which is mostly better than Levenshtein distance.

JavaScript
2528
2 年前

Go metrics for calculating string similarity and other string utility functions

Go
375
1 个月前

The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity

C++
314
9 天前

Rapid fuzzy string matching in C++ using the Levenshtein Distance

C++
291
2 个月前

A Tool for Measuring String Similarity

C
116
6 年前

The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity

114
2 个月前

Record Linkage ToolKit (Find and link entities)

Python
110
2 年前

Rust edit distance routines accelerated using SIMD. Supports fast Hamming, Levenshtein, restricted Damerau-Levenshtein, etc. distance calculations and string search.

Rust
108
2 年前

Lightweight string similarity function for javascript

JavaScript
100
1 年前

Levenshtein distance and similarity metrics with customizable edit costs and Winkler-like bonus for common prefix.

Go
87
5 年前

A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..

Scala
78
3 年前

Accelerating the deduplication and collapsing process for reads with Unique Molecular Identifiers (UMI). Heavily optimized for scalability and orders of magnitude faster than a previous tool.

Java
73
1 年前

Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity

Python
70
1 年前

Beda is a golang library for detecting how similar a two string

Go
54
4 年前

string similarity based on Dice's coefficient in go

Go
44
6 年前

A Mixed Trie and Levenshtein distance implementation in Java for extremely fast prefix string searching and string similarity.

Java
43
3 年前

Learning String Alignments for Entity Aliases

Python
37
6 年前

Learned string similarity for entity names using optimal transport.

Python
35
4 年前