Repository navigation
#
gpt-evaluation
- Website
- Wikipedia
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
HTML
8222
10 个月前
Evaluating LLMs with CommonGen-Lite
Python
91
1 年前
FactScoreLite is an implementation of the FactScore metric, designed for detailed accuracy assessment in text generation. This package builds upon the framework provided by the original FactScore repository, which is no longer maintained and contains outdated functions.
Python
13
1 年前