Repository navigation
#
gpt-evaluation
- Website
- Wikipedia
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
HTML
8126
6 个月前
Evaluating LLMs with CommonGen-Lite
Python
89
1 年前
FactScoreLite is an implementation of the FactScore metric, designed for detailed accuracy assessment in text generation. This package builds upon the framework provided by the original FactScore repository, which is no longer maintained and contains outdated functions.
Python
11
1 年前