Repository navigation

#

instruction-following

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python
30121
1 年前

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python
8458
12 天前

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook
1838
11 天前

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & Vertical Distillation of LLMs.

1137
5 个月前

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

1128
2 年前

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

Python
822
1 年前

A collection of ChatGPT and GPT-3.5 instruction-based prompts for generating and classifying text.

529
1 年前
Python
374
6 个月前
Python
345
1 个月前

[EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Models

Python
210
2 年前

[CVPR 2025] UniGoal: Towards Universal Zero-shot Goal-oriented Navigation

Python
206
3 个月前

[ACM Multimedia 2025 Datasets Track] EditWorld: Simulating World Dynamics for Instruction-Following Image Editing

Python
134
18 天前