Repository navigation

#

qwen2-5

Local CLI Copilot, powered by Ollama. 💻🦙

Go
1447
6 个月前

An open-source implementaion for fine-tuning Qwen2-VL and Qwen2.5-VL series by Alibaba Cloud.

Python
1223
3 天前

Experimental tools to backdoor large language models by re-writing their system prompts at a raw parameter level. This allows you to potentially execute offline remote code execution without running any actual code on the victim's machine or thwart LLM-based fraud/moderation systems.

Python
186
6 个月前

GPU-accelerated Llama3.java inference in pure Java using TornadoVM.

Java
179
2 天前

A light llama-like llm inference framework based on the triton kernel.

Python
153
14 天前

Deploy open-source LLMs on AWS in minutes — with OpenAI-compatible APIs and a powerful CLI/SDK toolkit.

Python
73
1 个月前

1st Place Solution for Eedi - Mining Misconceptions in Mathematics Kaggle Competition

Python
56
9 个月前

Hand-derived memory-efficient super lazy PyTorch VJPs for training LLMs on laptop, all using one op (bundled scaled matmuls).

Python
45
6 个月前

Java 23, SpringBoot 3.4.1 Examples using Deep Learning 4 Java & LangChain4J for Generative AI using ChatGPT LLM, RAG and other open source LLMs. Sentiment Analysis, Application Context based ChatBots. Custom Data Handling. LLMs - GPT 3.5 / 4o, Gemini Pro 1.5, Claude 3, Llama 3.1, Phi-3, Gemma 2, Falcon 3, Qwen 2.5, Mistral Nemo, Wizard Math

Java
33
9 个月前

Exploring Agno framework for building AI agents.

Python
25
7 个月前

Community-built Qwen AI Provider for Vercel AI SDK - Integrate Alibaba Cloud's Qwen models with Vercel's AI application framework

TypeScript
25
6 天前

Project Zephyrine: Your personal experimental glass cockpit for the world of ideas. Let's take flight with a modern, locally-run automaton, using accelerated thought to navigate the both digital aether and reality. skim the clouds of discovery.

HTML
22
14 天前

Silver Medal Solution for the Kaggle Competition: Eedi - Mining Misconceptions in Mathematics

Python
19
18 天前

Models: Deepseek R1 models, Llama3.2, Qwen2.5. Integrations: Ollama, Gradio. Supports Local LLM. Test and deploy the latest LLM models in the fastest and most efficient way

Python
16
8 个月前

grpo to train long form QA and instructions with long-form reward model

Python
15
3 个月前

FastLongSpeech is a novel framework designed to extend the capabilities of Large Speech-Language Models for efficient long-speech processing without necessitating dedicated long-speech training data.

Python
12
2 个月前