Repository navigation

#

cosine-similarity

Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...

Java
2730
3 年前

FAst Lookups of Cosine and Other Nearest Neighbors (based on fast locality-sensitive hashing)

C
1154
1 年前

Embeddable vector database for Go with Chroma-like interface and zero third-party dependencies. In-memory with optional persistence.

Go
726
2 个月前
Jupyter Notebook
547
8 年前
Jupyter Notebook
492
2 年前

Python package to accelerate the sparse matrix multiplication and top-n similarity selection

C++
415
6 天前

Score documents using embedding-vectors dot-product or cosine-similarity with ES Lucene engine

Java
406
2 年前

📈This repo contains detailed notes and multiple projects implemented in Python related to AI and Finance. Follow the blog here: https://purvasingh.medium.com

Jupyter Notebook
393
5 年前

A python project for checking plagiarism of documents based on cosine similarity

Python
325
1 年前

Spark library for generalized K-Means clustering. Supports general Bregman divergences. Suitable for clustering probabilistic data, time series data, high dimensional data, and very large data.

Scala
304
4 天前

Vector Storage is a vector database that enables semantic similarity searches on text documents in the browser's local storage. It uses OpenAI embeddings to convert documents into vectors and allows searching for similar documents based on cosine similarity.

TypeScript
234
10 个月前

A blazing-fast semantic search React component. Match by meaning, not just by letters. Search as you type without waiting (no debounce needed). Rank by cosine similarity.

JavaScript
219
1 年前

Coltt is a vector database that supports Multi-Vector Search, high-performance HNSW, FLAT and quantization, and enables fast searches through sophisticated internal data shard design.

Go
184
6 个月前

Real-Time Face Recognition use SCRFD, ArcFace, ByteTrack and Similarity Measure

Python
177
1 年前

中文文本相似度计算器

Python
160
1 年前