Repository navigation
retrieval
- Website
- Wikipedia
🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library
MTEB: Massive Text Embedding Benchmark
Apache Lucene.NET
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
Study guides for MIT's 15.003 Data Science Tools
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
Profile-Based Long-Term Memory for AI Applications
Superlinked is a Python framework for AI Engineers building high-performance search & recommendation applications that combine structured and unstructured data.
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
SGPT: GPT Sentence Embeddings for Semantic Search
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.
Epsilla is a high performance Vector Database Management System
Use late-interaction multi-modal models such as ColPali in just a few lines of code.
Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.
Parsing-free RAG supported by VLMs