Repository navigation

#

instruction-following

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python
30165
1 年前

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python
8465
2 个月前

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook
1868
2 个月前

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & Vertical Distillation of LLMs.

1180
7 个月前

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

1130
2 年前

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

Python
825
1 年前

A collection of ChatGPT and GPT-3.5 instruction-based prompts for generating and classifying text.

533
2 年前

Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)

Python
439
1 个月前
Python
377
7 个月前
Python
352
2 个月前

[CVPR 2025] UniGoal: Towards Universal Zero-shot Goal-oriented Navigation

Python
235
19 天前

[EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Models

Python
210
2 年前

This framework works as a form of user/machine calibration, with a focus on user-context and user-intent, deconstructing your ideas logically from A to B to Z.

144
10 天前