Repository navigation

#

jaro-winkler

Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...

Java
2711
3 年前

🪼 a python library for doing approximate and phonetic matching of strings.

Jupyter Notebook
2121
13 天前

📚 String comparison and edit distance algorithms library, featuring : Levenshtein, LCS, Hamming, Damerau levenshtein (OSA and Adjacent transpositions algorithms), Jaro-Winkler, Cosine, etc...

Go
506
3 年前

🎯 String metrics and phonetic algorithms for Scala (e.g. Dice/Sorensen, Hamming, Jaccard, Jaro, Jaro-Winkler, Levenshtein, Metaphone, N-Gram, NYSIIS, Overlap, Ratcliff/Obershelp, Refined NYSIIS, Refined Soundex, Soundex, Weighted Levenshtein).

Scala
485
8 年前

🔤 Rust implementations of string similarity metrics

Rust
435
1 年前

Go metrics for calculating string similarity and other string utility functions

Go
375
1 个月前

set of functions and operators for executing similarity queries

C
374
9 个月前

🦀📏 Rust library to compare strings (or any sequences). 25+ algorithms, pure Rust, common interface, Unicode support.

Rust
292
5 个月前

Ruby & C implementation of Jaro-Winkler distance algorithm which supports UTF-8 string.

Ruby
200
8 个月前

A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..

Scala
78
3 年前

Ruby gem (native extension in Rust) providing implementations of various string metrics

Ruby
78
3 年前

Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity

Python
70
1 年前

String similarity metrics for Elixir

Elixir
62
4 年前

String similarity functions, String distance's, Jaccard, Levenshtein, Hamming, Jaro-Winkler, Q-grams, N-grams, LCS - Longest Common Subsequence, Cosine similarity...

Go
61
8 年前

Spark functions to run popular phonetic and string matching algorithms

Scala
60
3 年前

PySpark phonetic and string matching algorithms

Python
39
1 年前

A collection of metrics and phonetic algorithms for fuzzy string matching in Elixir.

Elixir
35
2 年前

Fast batch jaro winkler distance implementation in C99 with Ruby, OCaml and Python bindings.

C
27
6 个月前

Distance related functions (Damerau-Levenshtein, Jaro-Winkler , longest common substring & subsequence) implemented as SQLite run-time loadable extension. Any UTF-8 strings are supported.

C
23
4 年前