Repository navigation

#

rlhf

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python
37448
1 年前
ymcui/Chinese-LLaMA-Alpaca-2

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Python
7175
1 个月前

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

Python
7027
1 个月前
huggingface/alignment-handbook

Robust recipes to align language models with human and AI preferences

Python
5322
22 天前

Align Anything: Training All-modality Model with Feedback

Jupyter Notebook
4532
3 个月前

A curated list of reinforcement learning with human feedback resources (continually updated)

4100
1 个月前
Kiln-AI/Kiln
Python
4056
42 分钟前

Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your own computer.

TypeScript
3813
3 小时前

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

Python
3717
2 年前
argilla-io/distilabel

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python
2850
1 天前

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook
1838
11 天前

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python
1740
2 天前

WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)

Python
1606
5 个月前

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation

Python
1501
7 个月前

Recipes to train reward model for RLHF.

Python
1438
4 个月前