Repository navigation

#

similarity-measures

Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...

Java
2730
3 年前
ashvardanian/SimSIMD

Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 📐

C
1513
3 天前

📚 String comparison and edit distance algorithms library, featuring : Levenshtein, LCS, Hamming, Damerau levenshtein (OSA and Adjacent transpositions algorithms), Jaro-Winkler, Cosine, etc...

Go
574
2 个月前

Scalable Time Series Data Analytics

Java
320
5 个月前

中文智能客服机器人demo,包含闲聊和专业问答2个部分,支持自定义组件(Chinese intelligent customer chatbot Demo, including the gossip and the professional Q&A(FAQ) , support for custom components!)

Python
313
3 年前

Quantify the difference between two arbitrary curves in space

Jupyter Notebook
260
3 个月前

金融时间序列(预测分析 / 相似度 / 数据处理)

Jupyter Notebook
251
1 年前

Python library for processing (tandem) mass spectrometry data and for computing spectral similarities.

Python
232
16 天前
Jupyter Notebook
205
4 年前

Information Theory and Distance Quantification with R

R
143
10 个月前

Reference implementation of the paper VERSE: Versatile Graph Embeddings from Similarity Measures

C++
133
5 年前

Free hands-on course with the implementation (in Python) and description of several computational, mathematical and statistical algorithms.

HTML
132
2 年前

vips-powered ruby gem to measure images similarity, implementing dHash and IDHash algorithms

Ruby
94
6 个月前
Rust
87
3 个月前

building a recommendation system using graph search methodologies. We will be comparing these different approaches and closely observe the limitations of each.

Python
68
9 年前

Formed trajectories of sets of points.Experimented on finding similarities between trajectories based on DTW (Dynamic Time Warping) and LCSS (Longest Common SubSequence) algorithms.Modeled trajectories as strings based on a Grid representation.Benchmarked KNN, Random Forest, Logistic Regression classification algorithms to classify efficiently trajectories.

Python
64
2 年前

Dynamic Time Warping single header library for C++

C++
54
5 年前

Romanian WordNet (Data + API for Python)

Python
52
5 年前