Repository navigation

gpt-evaluation

Website
Wikipedia

LianjiaTech / BELLE

BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）

bloom instruction-set llama open-models gpt-q instruct-gpt gpt-evaluation chinese-nlp lora instruct-finetune

HTML

8232

768

1 年前

allenai / CommonGen-Eval

Evaluating LLMs with CommonGen-Lite

ChatGPT evaluation gpt-evaluation llama2 大语言模型 llm-evaluation text-generation

Python

2 年前

armingh2000 / FactScoreLite

FactScoreLite is an implementation of the FactScore metric, designed for detailed accuracy assessment in text generation. This package builds upon the framework provided by the original FactScore repository, which is no longer maintained and contains outdated functions.

evaluation gpt-4 gpt-evaluation large-language-models llm-evaluation 大语言模型自然语言处理 openai question-answering

Python

1 年前