Repository navigation

#

similarity-measures

Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...

Java
2726
3 年前
ashvardanian/SimSIMD

Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 📐

C
1453
3 天前

📚 String comparison and edit distance algorithms library, featuring : Levenshtein, LCS, Hamming, Damerau levenshtein (OSA and Adjacent transpositions algorithms), Jaro-Winkler, Cosine, etc...

Go
523
6 小时前

Scalable Time Series Data Analytics

Java
318
4 个月前

中文智能客服机器人demo,包含闲聊和专业问答2个部分,支持自定义组件(Chinese intelligent customer chatbot Demo, including the gossip and the professional Q&A(FAQ) , support for custom components!)

Python
312
3 年前

Quantify the difference between two arbitrary curves in space

Jupyter Notebook
259
2 个月前

金融时间序列(预测分析 / 相似度 / 数据处理)

Jupyter Notebook
245
1 年前

Python library for processing (tandem) mass spectrometry data and for computing spectral similarities.

Python
224
4 天前
Jupyter Notebook
205
4 年前

Information Theory and Distance Quantification with R

R
142
8 个月前

Free hands-on course with the implementation (in Python) and description of several computational, mathematical and statistical algorithms.

HTML
132
2 年前

Reference implementation of the paper VERSE: Versatile Graph Embeddings from Similarity Measures

C++
131
4 年前

vips-powered ruby gem to measure images similarity, implementing dHash and IDHash algorithms

Ruby
93
4 个月前
Rust
88
1 个月前

building a recommendation system using graph search methodologies. We will be comparing these different approaches and closely observe the limitations of each.

Python
67
9 年前

Formed trajectories of sets of points.Experimented on finding similarities between trajectories based on DTW (Dynamic Time Warping) and LCSS (Longest Common SubSequence) algorithms.Modeled trajectories as strings based on a Grid representation.Benchmarked KNN, Random Forest, Logistic Regression classification algorithms to classify efficiently trajectories.

Python
65
2 年前

Romanian WordNet (Data + API for Python)

Python
52
5 年前

根据文本相似度实现问答的聊天机器人(简单版)

Python
52
7 年前