Repository navigation

rerank

Website
Wikipedia

🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference

llama rwkv 人工智能大语言模型 stable-diffusion API Kubernetes gpt4all tts musicgen mamba audio-generation image-generation text-generation gemma mistral llama3 rerank distributed libp2p

31887

2429

1 小时前

QuantumNous / new-api

AI模型接口管理与分发系统，支持将多种大模型转为统一格式调用，支持OpenAI、Claude等格式，可供个人或者企业内部管理与分发渠道使用，本项目基于One API二次开发。🍥 The next-generation LLM gateway and AI asset management system supports multiple languages.

claude gemini openai rerank ai-gateway deepseek

6819

1351

16 小时前

mgonzs13 / llama_ros

llama.cpp (GGUF LLMs) and llava.cpp (GGUF VLMs) for ROS 2

C++gpt llama 大语言模型 ros2 ggml gguf llamacpp llava vlm langchain embeddings rerank reranking

C++

196

5 天前

shell-nlp / gpt_server

gpt_server是一个用于生产级部署LLMs或Embedding的开源框架。

embedding gpt llama 大语言模型 openai prompt-injection rerank vllm tts fastchat function-calling asr

Python

167

3 天前

tensorlakeai / rerank-ts

rerank library for easy reranking of results

rerank reranking typescript-library

TypeScript

7 个月前

stephanj / BM25

A BM25 Java implementation using streams, stop words and stemming.

bm25 大语言模型自然语言处理 rerank

Java

1 年前

EliasPereirah / SearchAugmentedLLM

SearchAugmentedLLM empowers LLMs with information from the web

大语言模型 rag rerank reranker reranking retrieval-augmented-generation

PHP

2 个月前

bluechanel / deploy_llm

Rapid Deployment of LLM and Embedding Based on VLLM Using Docker

deploy Docker embedding 大语言模型 rerank vllm

Python

1 个月前

ittia-research / check

Automated fact-check

大语言模型 rag embedding rerank

Python

6 个月前

pashpashpash / python-rag-scaffold

A comprehensive RAG FastAPI service that handles document uploads and retrievals, built with Python. Uses PyMuPDF for document processing, turbopuffer for vector storage, OpenAI for models, and cohere for reranking.

模板 cohere cosine-similarity embeddings FastAPI observability openai Python rag rerank reranker retrieval-augmented-generation template uvicorn vector vector-database

Python

7 个月前

Lizhecheng02 / Kaggle-Eedi

Develop an nlp-based method to predict the affinity between misconceptions and incorrect answers (distractors) in multiple-choice questions.

cot embedding finetune 大语言模型 lora rerank vllm

Jupyter Notebook

4 个月前