- Paper: C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models
- Institution:
- Shanghai Jiao Tong University
- Tsinghua University
- University of Edinburgh
- Hong Kong University of Science and Technology
- arXiv: https://arxiv.org/abs/2305.08322
- GitHub: https://github.com/hkust-nlp/ceval
- Website: https://cevalbenchmark.com/
Evaluator | Metric | Description |
---|---|---|
CEvalEvaluator |
Accuracy | Multi-choice task |
Make sure you can access Hugging Face so that the dataset can be downloaded.
@inproceedings{huang2023ceval,
title={C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models},
author={Huang, Yuzhen and Bai, Yuzhuo and Zhu, Zhihao and Zhang, Junlei and Zhang, Jinghan and Su, Tangjun and Liu, Junteng and Lv, Chuancheng and Zhang, Yikai and Lei, Jiayi and Fu, Yao and Sun, Maosong and He, Junxian},
booktitle={Advances in Neural Information Processing Systems},
year={2023}
}