Repository navigation
document-retrieval
- Website
- Wikipedia
Open-source search and retrieval database for AI applications.
Distributed vector search for AI-native applications
The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.
Parsing-free RAG supported by VLMs
Vector search demo with the arXiv paper dataset, RedisVL, HuggingFace, OpenAI, Cohere, FastAPI, React, and Redis.
pgvector + embeddings API
Vietnamese long form question answering system with documents retrieval.
Implementation of ECIR 2022 Paper: How Can Graph Neural Networks Help Document Retrieval: A Case Study on CORD19 with Concept Map Generation
Retrieves the top 10 documents from the Wikipedia corpus for a user inputted free-text query
Document Querying with LLMs - Google PaLM API: Semantic Search With LLM Embeddings
ViDRILL is a Vietnamese document retrieval system for VLSP 2025. It combines dense and sparse retrieval, reranking, and optional LLM-based query rewriting and reasoning to support high-accuracy information retrieval and future LLM-enhanced pipelines.
Run text embeddings with Instructor-Large on AWS Lambda.
We address the task of learning contextualized word, sentence and document representations with a hierarchical language model by stacking Transformer-based encoders on a sentence level and subsequently on a document level and performing masked token prediction.
This project is a Document Retrieval application that utilizes Retrieval-Augmented Generation (RAG) techniques to enable users to interact with uploaded PDF documents. By leveraging a Large Language Model (LLM), users can ask questions about the content of the documents and receive accurate answers based on the information retrieved.
Built prediction and retrieval models for document retrieval, image retrieval, house price prediction, song recommendation, and analyzed sentiments using machine learning algorithms in Python
Client SDK for starpoint.ai
The Intelligent "ASKDOC" project combines the power of Langchain, Azure, OpenAI models, and Python to deliver an intelligent question-answering system, that scans your PDF documents and answer queries based on its contents. It can be queried using Human Natural Language.
Code and dataset for the paper "Redefining Absent Keyphrases and their Effect on Retrieval Effectiveness"