Repository navigation
#
gpt-evaluation
- Website
- Wikipedia
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
HTML
8232
1 年前
Evaluating LLMs with CommonGen-Lite
Python
91
2 年前
FactScoreLite is an implementation of the FactScore metric, designed for detailed accuracy assessment in text generation. This package builds upon the framework provided by the original FactScore repository, which is no longer maintained and contains outdated functions.
Python
13
1 年前