Repository navigation

#

gguf

Local AI API Platform

C++
2761
2 个月前
Mobile-Artificial-Intelligence/maid

Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.

Dart
2141
22 天前
Python
1991
16 天前

LLM Agent Framework in ComfyUI includes MCP sever, Omost,GPT-sovits, ChatTTS,GOT-OCR2.0, and FLUX prompt nodes,access to Feishu,discord,and adapts to all llms with similar openai / aisuite interfaces, such as o1,ollama, gemini, grok, qwen, GLM, deepseek, kimi,doubao. Adapted to local llms, vlm, gguf such as llama-3.3 Janus-Pro, Linkage graphRAG

Python
1859
3 天前

动手学Ollama,CPU玩转大模型部署,在线阅读地址:https://datawhalechina.github.io/handy-ollama/

Jupyter Notebook
1845
2 个月前
withcatai/node-llama-cpp

Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level

TypeScript
1629
8 天前

Interface for OuteTTS models.

Python
1360
2 个月前

LLM.swift is a simple and readable library that allows you to interact with large language models locally with ease for macOS, iOS, watchOS, tvOS, and visionOS.

C
703
9 天前

Go library for embedded vector search and semantic embeddings using llama.cpp

Go
482
2 个月前

GGUF implementation in C as a library and a tools CLI program

C
282
7 个月前

Webscout is the all-in-one search and AI toolkit you need. Discover insights with Yep.com, DuckDuckGo, and Phind; access cutting-edge AI models; transcribe YouTube videos; generate temporary emails and phone numbers; perform text-to-speech conversions; and much more!

Python
275
1 天前

LM inference server implementation based on *.cpp.

C++
261
4 天前

The Easiest Rust Interface for Local LLMs and an Interface for Deterministic Signals from Probabilistic LLM Vibes

Rust
229
14 天前

Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

Go
198
2 天前

GPU-accelerated Llama3.java inference in pure Java using TornadoVM.

Java
161
15 天前