Repository navigation

qwen2-5

Website
Wikipedia

yusufcanb / tlm

Local CLI Copilot, powered by Ollama. 💻🦙

大语言模型 Bash PowerShell llama3 Zsh deepseek-r1 qwen2-5 phi4

1447

6 个月前

2U1 / Qwen2-VL-Finetune

An open-source implementaion for fine-tuning Qwen2-VL and Qwen2.5-VL series by Alibaba Cloud.

聊天机器人 multimodal qwen2-vl vision-language vision-language-model qwen2-5

Python

1223

155

3 天前

zjunlp / OmniThink

[EMNLP 2025] OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking

人工智能 generation gpt information-seeking knowledge-augmented-generation large-language-models 自然语言处理 qwen retrieval-augmented-generation report-generation deepseek-r1 deepseek-v3 gpt4o qwen2-5

Python

461

1 个月前

sshh12 / llm_backdoor

Experimental tools to backdoor large language models by re-writing their system prompts at a raw parameter level. This allows you to potentially execute offline remote code execution without running any actual code on the victim's machine or thwart LLM-based fraud/moderation systems.

backdoor-attacks llm-security qwen2-5

Python

186

6 个月前

beehive-lab / GPULlama3.java

GPU-accelerated Llama3.java inference in pure Java using TornadoVM.

compilers gpu Java llama3 Nvidia gguf 大语言模型 mistral mistral-7b deepseek-r1 qwen2-5 qwen3

Java

179

2 天前

harleyszhang / lite_llama

A light llama-like llm inference framework based on the triton kernel.

llama llama3 大语言模型 llm-inference Python attention qwen2-5

Python

153

14 天前

aws-samples / easy-model-deployer

Deploy open-source LLMs on AWS in minutes — with OpenAI-compatible APIs and a powerful CLI/SDK toolkit.

deepseek deepseek-r1 ecs huggingface internlm2 langchain 大语言模型 ollama openai-compatible-api qwen2-5 sagemaker vllm qwq-32b gemma3 qwen3 deepseek-v3 gpt-oss

Python

1 个月前

rbiswasfc / eedi-mining-misconceptions

1st Place Solution for Eedi - Mining Misconceptions in Mathematics Kaggle Competition

大语言模型 qwen2-5 reranker retriever 自然语言处理

Python

9 个月前

HenryNdubuaku / super-lazy-autograd

Hand-derived memory-efficient super lazy PyTorch VJPs for training LLMs on laptop, all using one op (bundled scaled matmuls).

人工智能 fine-tuning finetuning huggingface 大语言模型 PyTorch qwen2-5 transformer

Python

6 个月前

arafkarsh / ms-springboot-ai

Java 23, SpringBoot 3.4.1 Examples using Deep Learning 4 Java & LangChain4J for Generative AI using ChatGPT LLM, RAG and other open source LLMs. Sentiment Analysis, Application Context based ChatBots. Custom Data Handling. LLMs - GPT 3.5 / 4o, Gemini Pro 1.5, Claude 3, Llama 3.1, Phi-3, Gemma 2, Falcon 3, Qwen 2.5, Mistral Nemo, Wizard Math

chatgpt3 chatgpt4 generative-ai llama2 gemini rag retrieval-augmented-generation falcon gemma mistral-7b langchain langchain4j ollama claude-3 gemini-ai convolutional-neural-network multilayer-perceptron-network recurrent-neural-network qwen2-5

Java

9 个月前

sgl-project / sgl-cookbook

Make SGLang go brrr

deepseek deepseek-r1 deepseek-v3 gpt-oss llama3 llama3-1 llama4 qwen2 qwen2-5 qwen3 ome sglang Kubernetes 大语言模型

4 天前

WebDevCaptain / agno-ai-agents

Exploring Agno framework for building AI agents.

ai-agents deepseek-r1 ollama openai qwen2-5

Python

7 个月前

Younis-Ahmed / qwen-ai-provider

Community-built Qwen AI Provider for Vercel AI SDK - Integrate Alibaba Cloud's Qwen models with Vercel's AI application framework

人工智能 vercel-ai vercel-ai-sdk qwen qwen2-5 qwen2-vl generative-ai Vercel alibaba-cloud language-model

TypeScript

6 天前

albertstarfield / project-zephyrine

Project Zephyrine: Your personal experimental glass cockpit for the world of ideas. Let's take flight with a modern, locally-run automaton, using accelerated thought to navigate the both digital aether and reality. skim the clouds of discovery.

ChatGPT ggml gguf agentic-ai openai-api realtime vulkan-api deepseek-r1 qwen2-5 chatgpt-app flux

HTML

14 天前

husaynirfan1 / simple-rag

Simple RAG system powered by Milvus.

milvus vector vector-database deepseek-r1 llama3-1 大语言模型 Python qwen2-5 nltk

Python

6 个月前

hiroshi-nagaya / Virtual_Try_Off

Get Clothes from image

深度学习 huggingface-transformers image-generation image-to-image mask Python qwen2-5 segmentation stable-diffusion e-commerce

Python

3 个月前

DaoyuanLi2816 / Kaggle-Eedi-Mining-Misconceptions-in-Mathematics-Silver-Medal

Silver Medal Solution for the Kaggle Competition: Eedi - Mining Misconceptions in Mathematics

kaggle-competition 大语言模型自然语言处理 qwen2-5

Python

18 天前

khaidq97 / SimpleChatbot

Models: Deepseek R1 models, Llama3.2, Qwen2.5. Integrations: Ollama, Gradio. Supports Local LLM. Test and deploy the latest LLM models in the fastest and most efficient way

聊天机器人 deepseek deepseek-r1 大语言模型 local local-llm ollama qwen2-5

Python

8 个月前

zli12321 / free-form-grpo

grpo to train long form QA and instructions with long-form reward model

evaluation-framework grpo qwen2-5 reinforcement-learning-algorithms

Python

3 个月前

ictnlp / FastLongSpeech

FastLongSpeech is a novel framework designed to extend the capabilities of Large Speech-Language Models for efficient long-speech processing without necessitating dedicated long-speech training data.

large-language-models llm-training 大语言模型 multi-modal speech speech-processing speech-recognition speech-to-text qwen qwen2-5

Python

2 个月前