lm_eval/tasks/med_concepts_qa/_med_concepts_qa.yaml (10 lines of code) (raw):
group: med_concepts_qa
task:
- med_concepts_qa_icd9cm
- med_concepts_qa_icd10cm
- med_concepts_qa_icd9proc
- med_concepts_qa_icd10proc
- med_concepts_qa_atc
aggregate_metric_list:
- metric: acc
aggregation: mean