Repository navigation

#

internvl

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Phi4, ...) (AAAI 2025).

Python
10191
2 天前

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python
4912
5 天前

Higher performance OpenAI LLM service than vLLM serve: A pure C++ high-performance OpenAI LLM service implemented with GPRS+TensorRT-LLM+Tokenizers.cpp, supporting chat and function call, AI agents, distributed multi-GPU inference, multimodal capabilities, and a Gradio chat interface.

Python
155
5 个月前

[ICCV 2025] Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives

Python
114
2 个月前

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell
0
5 个月前

ACL 2025, SDP workshop Shared Task Submission

Python
0
3 个月前

A handful of Ollama modelfiles intended for use with Home Assistant

0
11 天前