lmms_eval/tasks/mmbench/mmbench.yaml (11 lines of code) (raw):

group: mmbench task: - mmbench_en_dev - mmbench_en_test - mmbench_cn_dev - mmbench_cn_test - mmbench_cn_cc metadata: version: 0.0 sys_prompt: "There are several options:" gpt_eval_model_name: "gpt-3.5-turbo-0613"