Repository navigation

#

open-r1

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen2.5, Llama4, InternLM3, GLM4, Mistral, Yi1.5, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, DeepSeek-VL2, Phi4, GOT-OCR2, ...).

Python
7034
2 天前

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

Python
53
2 天前

R1V, trained with AI feedback, answers open-ended visual questions.

Python
11
7 天前

xVerify: Efficient Answer Verifier for Large Language Model Evaluations

Python
0
2 天前