lmms_eval/tasks/textvqa/textvqa_val.yaml (12 lines of code) (raw):
task: textvqa_val
test_split: validation
metric_list:
- metric: exact_match
aggregation: mean
higher_is_better: true
ignore_case: true
ignore_punctuation: true
- metric: submission
aggregation: !function utils.textvqa_aggreate_submissions
higher_is_better: true
include: _default_template_textvqa_yaml