lmms_eval/tasks/mmbench/mmbench.yaml (11 lines of code) (raw):
group: mmbench
task:
- mmbench_en_dev
- mmbench_en_test
- mmbench_cn_dev
- mmbench_cn_test
- mmbench_cn_cc
metadata:
version: 0.0
sys_prompt: "There are several options:"
gpt_eval_model_name: "gpt-3.5-turbo-0613"