Repository navigation
llamacpp
- Website
- Wikipedia
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.
Unified framework for building enterprise RAG pipelines with small, specialized models
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
Private & local AI personal knowledge management app for high entropy people.
State of the Art Natural Language Processing
Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web, vision.
The most no-nonsense, locally or API-hosted AI code completion plugin for Visual Studio Code - like GitHub Copilot but 100% free.
Simple, scalable AI model deployment on GPU clusters
A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.
AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.
LSP-AI is an open-source language server that serves as a backend for AI-powered functionality, designed to assist and empower software engineers, not replace them.
Cross-platform framework for deploying LLM/VLM/TTS models locally on smartphones.
Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.
RamaLama is an open-source developer tool that simplifies the local serving of AI models from any source and facilitates their use for inference in production, all through the familiar language of containers.
Instant, controllable, local pre-trained AI models in Rust
Text-To-Speech, RAG, and LLMs. All local!