Repository navigation

#

vector-similarity

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.

Cuda
940
1 天前

Embeddable, in-memory, document-oriented database with a high-level Query builder interface.

C++
792
21 小时前

Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, video2vec, graph2vec, bert, inception, etc)

Python
561
1 年前

Python, Java implementation of TS-SS called from "A Hybrid Geometric Approach for Measuring Similarity Level Among Documents and Document Clustering"

Python
300
6 年前

Vector Storage is a vector database that enables semantic similarity searches on text documents in the browser's local storage. It uses OpenAI embeddings to convert documents into vectors and allows searching for similar documents based on cosine similarity.

TypeScript
234
10 个月前

This repository contains various ways to calculate sentence vector similarity using NLP models

Python
197
5 年前

Timescale Vector Cookbook. A collection of recipes to build applications with LLMs using PostgreSQL and Timescale Vector.

Jupyter Notebook
122
9 个月前

AI Github assistant for your repo. Your proactive GitHub bot that auto-detects duplicates using OpenAI embeddings and Supabase magic!

TypeScript
34
9 个月前

Maximal Information Coefficient (MIC) Extension for Postgres

C
32
8 个月前

This code example shows how to make a chatbot for semantic search over documents using Streamlit, LangChain, and various vector databases. The chatbot lets users ask questions and get answers from a document collection. The code is in Python and can be customized for different scenarios and data.

Jupyter Notebook
15
2 年前

Scripts for reading, extracting, and organizing data from either HTML or PDF documents and prepare them to be converted into embeddings for use in context-augmented LLM queries.

Python
13
1 年前

🧠 leverage advanced AI embeddings to perform multilingual zero-shot text classification. Whether you're dealing with unlabelled data or seeking to classify text against dynamic and user-defined labels, this library provides a seamless and efficient solution.

TypeScript
12
4 个月前

Fun with Game of Thrones word embeddings

Jupyter Notebook
11
8 年前