Repository navigation

#

preference-alignment

[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward

Python
913
6 个月前

[Paper][ACL 2024 Findings] Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answering

Python
194
1 年前

DPO-Shift: Shifting the Distribution of Direct Preference Optimization

Python
60
6 个月前

[NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$

Python
47
10 个月前

Code for "ReSpace: Text-Driven 3D Scene Synthesis and Editing with Preference Alignment"

Python
40
18 天前

Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).

Python
39
1 年前

[ICLR 2025] Official code of "Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization"

Python
17
1 年前

[ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization

Python
12
7 个月前

[ICCV 2025] Official repository of "Mitigating Object Hallucinations via Sentence-Level Early Intervention".

Python
9
21 天前

[ICML 2025] TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization

Python
6
1 个月前

[ICML 25] "Preference Optimization for Combinatorial Optimization Problems"

Python
4
2 个月前

Survey of preference alignment algorithms

0
1 年前

Generate synthetic datasets for instruction tuning and preference alignment using tools like `distilabel` for efficient and scalable data creation.

Jupyter Notebook
0
7 个月前