lm_eval/tasks/super_glue/boolq/seq2seq.yaml (26 lines of code) (raw):

tag: - super-glue-lm-eval-v1-seq2seq task: "boolq-seq2seq" dataset_path: super_glue dataset_name: boolq output_type: generate_until training_split: train validation_split: validation doc_to_text: "{{passage}}\nQuestion: {{question}}?\nAnswer:" doc_to_target: label doc_to_choice: [' no', ' yes'] target_delimiter: "" generation_kwargs: until: - "\n\n" - "\n" do_sample: false temperature: 0.0 metric_list: - metric: exact_match aggregation: mean higher_is_better: true ignore_case: true ignore_punctuation: true metadata: version: 0.0