Repository navigation

#

llama-cpp

A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!

TypeScript
10959
1 年前

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

C#
3123
3 天前
Mobile-Artificial-Intelligence/maid

Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.

Dart
1979
1 天前
withcatai/node-llama-cpp

Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level

TypeScript
1433
23 天前
Go
1350
7 个月前
C++
487
1 个月前

prima.cpp: Speeding up 70B-scale LLM inference on low-resource everyday home clusters

C++
454
3 天前
Rust
379
10 个月前

This repo is to showcase how you can run a model locally and offline, free of OpenAI dependencies.

Python
267
9 个月前

Run LLMs locally. A clojure wrapper for llama.cpp.

Clojure
162
22 天前

Booster - open accelerator for LLM models. Better inference and debugging for AI hackers

C++
154
8 个月前

Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

Go
148
2 天前

workbench for learing&practising AI tech in real scenario on Android device, powered by GGML(Georgi Gerganov Machine Learning) and FFmpeg

C++
136
2 个月前

LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.

Python
122
2 年前

Your customized AI assistant - Personal assistants on any hardware! With llama.cpp, whisper.cpp, ggml, LLaMA-v2.

C++
109
1 年前