Repository navigation

#

cosine-similarity

Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...

Java
2711
3 年前

FAst Lookups of Cosine and Other Nearest Neighbors (based on fast locality-sensitive hashing)

C
1150
1 年前
Jupyter Notebook
546
8 年前

Embeddable vector database for Go with Chroma-like interface and zero third-party dependencies. In-memory with optional persistence.

Go
521
6 天前
Jupyter Notebook
490
1 年前

Python package to accelerate the sparse matrix multiplication and top-n similarity selection

C++
404
19 天前

Score documents using embedding-vectors dot-product or cosine-similarity with ES Lucene engine

Java
396
1 年前

📈This repo contains detailed notes and multiple projects implemented in Python related to AI and Finance. Follow the blog here: https://purvasingh.medium.com

Jupyter Notebook
364
4 年前

A python project for checking plagiarism of documents based on cosine similarity

Python
314
8 个月前

Spark library for generalized K-Means clustering. Supports general Bregman divergences. Suitable for clustering probabilistic data, time series data, high dimensional data, and very large data.

Scala
301
13 天前

Vector Storage is a vector database that enables semantic similarity searches on text documents in the browser's local storage. It uses OpenAI embeddings to convert documents into vectors and allows searching for similar documents based on cosine similarity.

TypeScript
223
4 个月前

A blazing-fast semantic search React component. Match by meaning, not just by letters. Search as you type without waiting (no debounce needed). Rank by cosine similarity.

JavaScript
217
8 个月前

Coltt is a vector database that supports Multi-Vector Search, high-performance HNSW, FLAT and quantization, and enables fast searches through sophisticated internal data shard design.

Go
183
1 个月前

Real-Time Face Recognition use SCRFD, ArcFace, ByteTrack and Similarity Measure

Python
159
6 个月前

Information Retrieval algorithms developed in python. To follow the blog posts, click on the link:

Jupyter Notebook
131
4 年前