Repository navigation
retrieval
- Website
- Wikipedia
MTEB: Massive Text Embedding Benchmark
🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
Apache Lucene.NET
Profile-Based Long-Term Memory for AI Applications. Memobase handles user profiles, memory events, and evolving context — perfect for chatbots, companions, tutors, customer service bots, and all chat-based agents.
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Study guides for MIT's 15.003 Data Science Tools
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
Superlinked is a Python framework for AI Engineers building high-performance search & recommendation applications that combine structured and unstructured data.
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
SGPT: GPT Sentence Embeddings for Semantic Search
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
Epsilla is a high performance Vector Database Management System
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.
Use late-interaction multi-modal models such as ColPali in just a few lines of code.
Parsing-free RAG supported by VLMs