Repository navigation

#

rerank

🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P and decentralized inference

Go
35613
3 小时前

AI模型聚合管理分发系统,支持将多种大模型转为统一格式调用,支持OpenAI、Claude、Gemini等格式,可供个人或者企业内部管理与分发渠道使用。🍥 The next-generation LLM gateway and AI asset management system supports multiple languages.

JavaScript
11016
18 小时前

gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。

Python
212
10 天前

rerank library for easy reranking of results

TypeScript
50
1 年前

A BM25 Java implementation using streams, stop words and stemming.

Java
39
2 年前

基于FastAPI的文本嵌入向量生成API, 处理Embedding+Rerank模型,兼容OpenAI、硅基流动格式

HTML
28
6 天前

A corporate law RAG system with innovative retrieval and contextual strategies

Python
21
6 天前

Rapid Deployment of LLM and Embedding Based on VLLM Using Docker

Python
9
7 个月前

SearchAugmentedLLM empowers LLMs with information from the web

PHP
8
7 个月前
Python
8
1 年前

A comprehensive RAG FastAPI service that handles document uploads and retrievals, built with Python. Uses PyMuPDF for document processing, turbopuffer for vector storage, OpenAI for models, and cohere for reranking.

Python
5
1 年前

基于 LangGraph 实现的聊天机器人,接入 DeepSeek、Qwen、智谱 AI 多个 LLM 模型,支持在线搜索和文件解析。

Python
5
5 个月前

CRoM (Context Rot Mitigation)-EfficientLLM is a Python toolkit designed to optimize the context provided to Large Language Models (LLMs). It provides a suite of tools to intelligently select, re-rank, and manage text chunks to fit within a model's context budget while maximizing relevance and minimizing performance drift.

Python
4
18 天前

A Python project that deploys a Local RAG chatbot using Ollama API and vLLM API. Refines answers with internal RAG knowledge base, using both Embedding and Rerank models to improve accuracy of context provided to LLM models.

Python
4
5 个月前

Develop an nlp-based method to predict the affinity between misconceptions and incorrect answers (distractors) in multiple-choice questions.

Jupyter Notebook
2
10 个月前

go client for text-embedding-inference (https://github.com/huggingface/text-embeddings-inference)

Go
1
5 个月前