Repository navigation

retrieval

Website
Wikipedia

MTEB: Massive Text Embedding Benchmark

benchmark clustering information-retrieval sentence-transformers sts text-embedding retrieval neural-search semantic-search sbert text-classification reranking

Python

2876

475

11 小时前

chonkie-ai / chonkie

🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library

人工智能 chunking rag text-processing 自然语言处理 Python semantic-segmentation vector-search etl retrieval

Python

2872

125

6 个月前

VectifyAI / PageIndex

📄🧠 PageIndex: Document Index for Reasoning-based RAG

人工智能大语言模型 rag reasoning retrieval

Python

2648

204

15 天前

qdrant / fastembed

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

embeddings openai rag retrieval retrieval-augmented-generation vector-search

Python

2418

156

1 个月前

apache / lucenenet

Apache Lucene.NET

lucene text search information retrieval analysis index Query (disambiguation)apache Hacktoberfest

2337

647

7 小时前

memodb-io / memobase

Profile-Based Long-Term Memory for AI Applications. Memobase handles user profiles, memory events, and evolving context — perfect for chatbots, companions, tutors, customer service bots, and all chat-based agents.

ChatGPT llm-application memory rag retrieval ai-companion ai-memory long-term-memory llm-memory

Python

2201

163

15 天前

intel / intel-extension-for-transformers

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

大语言模型聊天机器人 4-bits llm-inference llm-cpu chatpdf streamingllm intel-optimized-llamacpp speculative-decoding habana rag retrieval

Python

2165

214

1 年前

beir-cellar / beir

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

自然语言处理 information-retrieval bert benchmark sentence-transformers retrieval elasticsearch sbert dataset colbert 深度学习 PyTorch 大语言模型 rag

Python

1970

220

4 个月前

shervinea / mit-15-003-data-science-tools

Study guides for MIT's 15.003 Data Science Tools

study-guide 数据科学 SQL R Git Bash manipulation visualization retrieval

1877

370

5 年前

parthsarthi03 / raptor

The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

rag retrieval retrieval-augmented-generation clustering language-model 机器学习 vector-database agents 框架大语言模型

Python

1427

189

1 年前

superlinked / superlinked

Superlinked is a Python framework for AI Engineers building high-performance search & recommendation applications that combine structured and unstructured data.

embeddings etl vector-search data-pipeline 深度学习 information-retrieval 大语言模型机器学习 mlops 自然语言处理 Python retrieval retrieval-augmented-generation semantic-search vectorization vector-database

Jupyter Notebook

1385

104

11 天前