Repository navigation
#
open-r1
- Website
- Wikipedia
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen2.5, Llama4, InternLM3, GLM4, Mistral, Yi1.5, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, DeepSeek-VL2, Phi4, GOT-OCR2, ...).
Python
7034
2 天前
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations
Python
53
2 天前
Python
11
7 天前
xVerify: Efficient Answer Verifier for Large Language Model Evaluations
Python
0
2 天前