Repository navigation

#

rerank

🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference

Go
34734
2 小时前

AI模型接口管理与分发系统,支持将多种大模型转为统一格式调用,支持OpenAI、Claude等格式,可供个人或者企业内部管理与分发渠道使用,本项目基于One API二次开发。🍥 The next-generation LLM gateway and AI asset management system supports multiple languages.

Go
9852
1 天前

gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR和TTS的开源框架。

Python
205
2 天前

rerank library for easy reranking of results

TypeScript
48
1 年前

A BM25 Java implementation using streams, stop words and stemming.

Java
38
2 年前

基于FastAPI的文本嵌入向量生成API, 处理Embedding+Rerank模型,兼容OpenAI、硅基流动格式

HTML
24
7 天前

Rapid Deployment of LLM and Embedding Based on VLLM Using Docker

Python
9
5 个月前
Python
8
10 个月前

SearchAugmentedLLM empowers LLMs with information from the web

PHP
8
6 个月前

A comprehensive RAG FastAPI service that handles document uploads and retrievals, built with Python. Uses PyMuPDF for document processing, turbopuffer for vector storage, OpenAI for models, and cohere for reranking.

Python
5
1 年前

基于 LangGraph 实现的聊天机器人,接入 DeepSeek、Qwen、智谱 AI 多个 LLM 模型,支持在线搜索和文件解析。

Python
5
3 个月前

A Python project that deploys a Local RAG chatbot using Ollama API and vLLM API. Refines answers with internal RAG knowledge base, using both Embedding and Rerank models to improve accuracy of context provided to LLM models.

Python
4
4 个月前

Develop an nlp-based method to predict the affinity between misconceptions and incorrect answers (distractors) in multiple-choice questions.

Jupyter Notebook
2
8 个月前

go client for text-embedding-inference (https://github.com/huggingface/text-embeddings-inference)

Go
1
3 个月前

A Python project that deploys a Local RAG chatbot using Ollama API and vLLM API. Refines answers with internal RAG knowledge base, using both Embedding and Rerank models to improve accuracy of context provided to LLM models.

Python
0
1 天前

The code features a Rerank RAG developed in the Python programming language.

Python
0
3 个月前