Repository navigation
#
mmlu
- Website
- Wikipedia
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
Python
5685
1 年前
A series of large language models developed by Baichuan Intelligent Technology
Python
4124
9 个月前
A 13B large language model developed by Baichuan Intelligent Technology
Python
2968
2 年前
A Contamination-free Multi-task Language Understanding Benchmark [Official, ACL 2025]
120
3 个月前
[NeurIPS 2023 Spotlight] In-Context Impersonation Reveals Large Language Models' Strengths and Biases
Python
22
9 个月前
AGI-Elo: How Far Are We From Mastering A Task?
Python
6
3 个月前
CLI tool to evaluate LLM factuality on MMLU benchmark.
Python
2
5 天前
LLMs' performance analysis on CPU, GPU, Execution Time and Energy Usage
Java
0
1 年前