lm_eval/tasks/med_concepts_qa/_med_concepts_qa.yaml (10 lines of code) (raw):

group: med_concepts_qa task: - med_concepts_qa_icd9cm - med_concepts_qa_icd10cm - med_concepts_qa_icd9proc - med_concepts_qa_icd10proc - med_concepts_qa_atc aggregate_metric_list: - metric: acc aggregation: mean