Repository navigation
#
on-device-llms
- Website
- Wikipedia
prima.cpp: Speeding up 70B-scale LLM inference on low-resource everyday home clusters
C++
473
4 天前
Multi-agent workflows with Llama3: A private on-device multi-agent framework
Python
3
1 年前