Repository navigation

#

jaro-winkler

Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...

Java
2726
3 年前

🪼 a python library for doing approximate and phonetic matching of strings.

Jupyter Notebook
2150
2 天前

📚 String comparison and edit distance algorithms library, featuring : Levenshtein, LCS, Hamming, Damerau levenshtein (OSA and Adjacent transpositions algorithms), Jaro-Winkler, Cosine, etc...

Go
523
3 小时前

🎯 String metrics and phonetic algorithms for Scala (e.g. Dice/Sorensen, Hamming, Jaccard, Jaro, Jaro-Winkler, Levenshtein, Metaphone, N-Gram, NYSIIS, Overlap, Ratcliff/Obershelp, Refined NYSIIS, Refined Soundex, Soundex, Weighted Levenshtein).

Scala
485
8 年前

🔤 Rust implementations of string similarity metrics

Rust
445
1 年前

Go metrics for calculating string similarity and other string utility functions

Go
386
7 天前

set of functions and operators for executing similarity queries

C
384
3 个月前

🦀📏 Rust library to compare strings (or any sequences). 25+ algorithms, pure Rust, common interface, Unicode support.

Rust
296
9 个月前

Ruby & C implementation of Jaro-Winkler distance algorithm which supports UTF-8 string.

Ruby
201
3 个月前

A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..

Scala
80
3 年前

Ruby gem (native extension in Rust) providing implementations of various string metrics

Ruby
77
3 年前

Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity

Python
73
2 年前

String similarity metrics for Elixir

Elixir
62
4 年前

String similarity functions, String distance's, Jaccard, Levenshtein, Hamming, Jaro-Winkler, Q-grams, N-grams, LCS - Longest Common Subsequence, Cosine similarity...

Go
61
8 年前

Spark functions to run popular phonetic and string matching algorithms

Scala
60
3 年前

PySpark phonetic and string matching algorithms

Python
39
2 年前

A collection of metrics and phonetic algorithms for fuzzy string matching in Elixir.

Elixir
37
2 年前

Fast batch jaro winkler distance implementation in C99 with Ruby, OCaml and Python bindings.

C
28
10 个月前

Distance related functions (Damerau-Levenshtein, Jaro-Winkler , longest common substring & subsequence) implemented as SQLite run-time loadable extension. Any UTF-8 strings are supported.

C
24
4 年前