duplicated block id: 1 size: 54 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2020-fgvc7/prepare.py (53:118) - mlebench/competitions/herbarium-2021-fgvc8/prepare.py (53:118) duplicated block id: 2 size: 42 cleaned lines of code in 2 files: - mlebench/competitions/text-normalization-challenge-english-language/prepare.py (44:93) - mlebench/competitions/text-normalization-challenge-russian-language/prepare.py (42:91) duplicated block id: 3 size: 41 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2020-fgvc7/prepare.py (206:251) - mlebench/competitions/herbarium-2021-fgvc8/prepare.py (206:251) duplicated block id: 4 size: 31 cleaned lines of code in 2 files: - mlebench/competitions/text-normalization-challenge-english-language/grade.py (7:49) - mlebench/competitions/text-normalization-challenge-russian-language/grade.py (7:49) duplicated block id: 5 size: 25 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2020-fgvc7/prepare.py (152:184) - mlebench/competitions/herbarium-2021-fgvc8/prepare.py (152:184) duplicated block id: 6 size: 19 cleaned lines of code in 2 files: - mlebench/competitions/rsna-2022-cervical-spine-fracture-detection/config.yaml (3:21) - mlebench/competitions/rsna-miccai-brain-tumor-radiogenomic-classification/config.yaml (3:21) duplicated block id: 7 size: 19 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2020-fgvc7/grade.py (7:34) - mlebench/competitions/herbarium-2021-fgvc8/grade.py (7:34) duplicated block id: 8 size: 17 cleaned lines of code in 2 files: - mlebench/competitions/text-normalization-challenge-russian-language/grade.py (18:41) - mlebench/competitions/utils.py (197:220) duplicated block id: 9 size: 17 cleaned lines of code in 2 files: - mlebench/competitions/text-normalization-challenge-english-language/grade.py (18:41) - mlebench/competitions/utils.py (197:220) duplicated block id: 10 size: 16 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2021-fgvc8/prepare.py (152:170) - mlebench/competitions/herbarium-2022-fgvc9/prepare.py (139:159) duplicated block id: 11 size: 16 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2020-fgvc7/prepare.py (152:170) - mlebench/competitions/herbarium-2022-fgvc9/prepare.py (139:159) duplicated block id: 12 size: 15 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2021-fgvc8/prepare.py (63:79) - mlebench/competitions/herbarium-2022-fgvc9/prepare.py (56:72) duplicated block id: 13 size: 15 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2020-fgvc7/prepare.py (63:80) - mlebench/competitions/herbarium-2022-fgvc9/prepare.py (56:72) duplicated block id: 14 size: 14 cleaned lines of code in 2 files: - mlebench/competitions/google-research-identify-contrails-reduce-global-warming/config.yaml (4:17) - mlebench/competitions/h-and-m-personalized-fashion-recommendations/config.yaml (4:17) duplicated block id: 15 size: 13 cleaned lines of code in 2 files: - mlebench/competitions/random-acts-of-pizza/prepare.py (63:75) - mlebench/competitions/statoil-iceberg-classifier-challenge/prepare.py (84:96) duplicated block id: 16 size: 13 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2021-fgvc8/prepare.py (238:251) - mlebench/competitions/herbarium-2022-fgvc9/prepare.py (220:233) duplicated block id: 17 size: 13 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2020-fgvc7/prepare.py (238:251) - mlebench/competitions/herbarium-2022-fgvc9/prepare.py (220:233) duplicated block id: 18 size: 12 cleaned lines of code in 2 files: - mlebench/competitions/text-normalization-challenge-english-language/prepare.py (23:36) - mlebench/competitions/text-normalization-challenge-russian-language/prepare.py (21:34) duplicated block id: 19 size: 12 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2021-fgvc8/prepare.py (106:118) - mlebench/competitions/herbarium-2022-fgvc9/prepare.py (84:96) duplicated block id: 20 size: 12 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2020-fgvc7/prepare.py (106:118) - mlebench/competitions/herbarium-2022-fgvc9/prepare.py (84:96) duplicated block id: 21 size: 12 cleaned lines of code in 2 files: - mlebench/competitions/jigsaw-unintended-bias-in-toxicity-classification/config.yaml (14:25) - mlebench/competitions/vesuvius-challenge-ink-detection/config.yaml (14:25) duplicated block id: 22 size: 11 cleaned lines of code in 2 files: - extras/kernels/download_kernel_references.py (58:72) - extras/kernels/download_kernels.py (136:150) duplicated block id: 23 size: 10 cleaned lines of code in 2 files: - mlebench/competitions/jigsaw-unintended-bias-in-toxicity-classification/config.yaml (12:21) - mlebench/competitions/rsna-breast-cancer-detection/config.yaml (12:21) duplicated block id: 24 size: 10 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2021-fgvc8/prepare.py (100:111) - mlebench/competitions/inaturalist-2019-fgvc6/prepare.py (91:102) duplicated block id: 25 size: 10 cleaned lines of code in 2 files: - mlebench/competitions/jigsaw-unintended-bias-in-toxicity-classification/config.yaml (16:25) - mlebench/competitions/siim-covid19-detection/config.yaml (16:25) duplicated block id: 26 size: 10 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2020-fgvc7/prepare.py (100:111) - mlebench/competitions/inaturalist-2019-fgvc6/prepare.py (91:102) duplicated block id: 27 size: 10 cleaned lines of code in 2 files: - mlebench/competitions/siim-covid19-detection/config.yaml (16:25) - mlebench/competitions/vesuvius-challenge-ink-detection/config.yaml (16:25) duplicated block id: 28 size: 10 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2021-fgvc8/grade.py (22:34) - mlebench/competitions/herbarium-2022-fgvc9/grade.py (19:31) duplicated block id: 29 size: 10 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2020-fgvc7/grade.py (22:34) - mlebench/competitions/herbarium-2022-fgvc9/grade.py (19:31) duplicated block id: 30 size: 9 cleaned lines of code in 2 files: - agents/opendevin/start.py (107:116) - agents/opendevin/start.py (128:137) duplicated block id: 31 size: 9 cleaned lines of code in 2 files: - mlebench/competitions/text-normalization-challenge-english-language/config.yaml (3:11) - mlebench/competitions/text-normalization-challenge-russian-language/config.yaml (3:11) duplicated block id: 32 size: 9 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2020-fgvc7/prepare.py (225:235) - mlebench/competitions/herbarium-2022-fgvc9/prepare.py (207:217) duplicated block id: 33 size: 9 cleaned lines of code in 2 files: - mlebench/competitions/alaska2-image-steganalysis/config.yaml (3:11) - mlebench/competitions/stanford-covid-vaccine/config.yaml (3:11) duplicated block id: 34 size: 9 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2020-fgvc7/prepare.py (206:214) - mlebench/competitions/inaturalist-2019-fgvc6/prepare.py (241:250) duplicated block id: 35 size: 9 cleaned lines of code in 2 files: - mlebench/competitions/stanford-covid-vaccine/config.yaml (3:11) - mlebench/competitions/text-normalization-challenge-russian-language/config.yaml (3:11) duplicated block id: 36 size: 9 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2021-fgvc8/prepare.py (225:235) - mlebench/competitions/herbarium-2022-fgvc9/prepare.py (207:217) duplicated block id: 37 size: 9 cleaned lines of code in 2 files: - mlebench/competitions/alaska2-image-steganalysis/config.yaml (3:11) - mlebench/competitions/text-normalization-challenge-english-language/config.yaml (3:11) duplicated block id: 38 size: 9 cleaned lines of code in 2 files: - mlebench/competitions/alaska2-image-steganalysis/config.yaml (3:11) - mlebench/competitions/text-normalization-challenge-russian-language/config.yaml (3:11) duplicated block id: 39 size: 9 cleaned lines of code in 2 files: - mlebench/competitions/bms-molecular-translation/config.yaml (3:11) - mlebench/competitions/statoil-iceberg-classifier-challenge/config.yaml (3:11) duplicated block id: 40 size: 9 cleaned lines of code in 2 files: - mlebench/competitions/petfinder-pawpularity-score/config.yaml (3:11) - mlebench/competitions/tensorflow2-question-answering/config.yaml (3:11) duplicated block id: 41 size: 9 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2020-fgvc7/prepare.py (193:202) - mlebench/competitions/herbarium-2021-fgvc8/prepare.py (193:202) duplicated block id: 42 size: 9 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2021-fgvc8/prepare.py (206:214) - mlebench/competitions/inaturalist-2019-fgvc6/prepare.py (241:250) duplicated block id: 43 size: 9 cleaned lines of code in 2 files: - mlebench/competitions/stanford-covid-vaccine/config.yaml (3:11) - mlebench/competitions/text-normalization-challenge-english-language/config.yaml (3:11) duplicated block id: 44 size: 9 cleaned lines of code in 2 files: - mlebench/competitions/learning-agency-lab-automated-essay-scoring-2/config.yaml (3:11) - mlebench/competitions/petfinder-pawpularity-score/config.yaml (3:11) duplicated block id: 45 size: 9 cleaned lines of code in 2 files: - mlebench/competitions/learning-agency-lab-automated-essay-scoring-2/config.yaml (3:11) - mlebench/competitions/tensorflow2-question-answering/config.yaml (3:11) duplicated block id: 46 size: 9 cleaned lines of code in 2 files: - mlebench/competitions/us-patent-phrase-to-phrase-matching/config.yaml (3:11) - mlebench/competitions/uw-madison-gi-tract-image-segmentation/config.yaml (3:11) duplicated block id: 47 size: 8 cleaned lines of code in 2 files: - mlebench/competitions/3d-object-detection-for-autonomous-vehicles/config.yaml (4:11) - mlebench/competitions/us-patent-phrase-to-phrase-matching/config.yaml (4:11) duplicated block id: 48 size: 8 cleaned lines of code in 2 files: - mlebench/competitions/alaska2-image-steganalysis/config.yaml (4:11) - mlebench/competitions/petfinder-pawpularity-score/config.yaml (4:11) duplicated block id: 49 size: 8 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2020-fgvc7/prepare.py (120:130) - mlebench/competitions/herbarium-2021-fgvc8/prepare.py (120:130) duplicated block id: 50 size: 8 cleaned lines of code in 2 files: - mlebench/competitions/3d-object-detection-for-autonomous-vehicles/config.yaml (4:11) - mlebench/competitions/uw-madison-gi-tract-image-segmentation/config.yaml (4:11) duplicated block id: 51 size: 8 cleaned lines of code in 2 files: - mlebench/competitions/learning-agency-lab-automated-essay-scoring-2/config.yaml (4:11) - mlebench/competitions/text-normalization-challenge-english-language/config.yaml (4:11) duplicated block id: 52 size: 8 cleaned lines of code in 2 files: - mlebench/competitions/petfinder-pawpularity-score/config.yaml (4:11) - mlebench/competitions/stanford-covid-vaccine/config.yaml (4:11) duplicated block id: 53 size: 8 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2020-fgvc7/prepare.py (193:200) - mlebench/competitions/inaturalist-2019-fgvc6/prepare.py (155:162) duplicated block id: 54 size: 8 cleaned lines of code in 2 files: - mlebench/competitions/tensorflow2-question-answering/config.yaml (4:11) - mlebench/competitions/text-normalization-challenge-russian-language/config.yaml (4:11) duplicated block id: 55 size: 8 cleaned lines of code in 2 files: - mlebench/competitions/rsna-miccai-brain-tumor-radiogenomic-classification/config.yaml (4:11) - mlebench/competitions/seti-breakthrough-listen/config.yaml (4:11) duplicated block id: 56 size: 8 cleaned lines of code in 2 files: - extras/kernels/download_kernel_references.py (45:52) - extras/kernels/download_kernels.py (129:136) duplicated block id: 57 size: 8 cleaned lines of code in 2 files: - mlebench/competitions/lmsys-chatbot-arena/config.yaml (3:10) - mlebench/competitions/vesuvius-challenge-ink-detection/config.yaml (3:10) duplicated block id: 58 size: 8 cleaned lines of code in 2 files: - mlebench/competitions/learning-agency-lab-automated-essay-scoring-2/config.yaml (4:11) - mlebench/competitions/stanford-covid-vaccine/config.yaml (4:11) duplicated block id: 59 size: 8 cleaned lines of code in 2 files: - mlebench/competitions/stanford-covid-vaccine/config.yaml (4:11) - mlebench/competitions/tensorflow2-question-answering/config.yaml (4:11) duplicated block id: 60 size: 8 cleaned lines of code in 2 files: - mlebench/competitions/petfinder-pawpularity-score/config.yaml (4:11) - mlebench/competitions/text-normalization-challenge-english-language/config.yaml (4:11) duplicated block id: 61 size: 8 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2021-fgvc8/prepare.py (193:200) - mlebench/competitions/inaturalist-2019-fgvc6/prepare.py (155:162) duplicated block id: 62 size: 8 cleaned lines of code in 2 files: - mlebench/competitions/alaska2-image-steganalysis/config.yaml (4:11) - mlebench/competitions/tensorflow2-question-answering/config.yaml (4:11) duplicated block id: 63 size: 8 cleaned lines of code in 2 files: - mlebench/competitions/rsna-2022-cervical-spine-fracture-detection/config.yaml (4:11) - mlebench/competitions/seti-breakthrough-listen/config.yaml (4:11) duplicated block id: 64 size: 8 cleaned lines of code in 2 files: - mlebench/competitions/rsna-breast-cancer-detection/config.yaml (14:21) - mlebench/competitions/vesuvius-challenge-ink-detection/config.yaml (14:21) duplicated block id: 65 size: 8 cleaned lines of code in 2 files: - mlebench/competitions/tensorflow2-question-answering/config.yaml (4:11) - mlebench/competitions/text-normalization-challenge-english-language/config.yaml (4:11) duplicated block id: 66 size: 8 cleaned lines of code in 2 files: - mlebench/competitions/alaska2-image-steganalysis/config.yaml (4:11) - mlebench/competitions/learning-agency-lab-automated-essay-scoring-2/config.yaml (4:11) duplicated block id: 67 size: 8 cleaned lines of code in 2 files: - mlebench/competitions/petfinder-pawpularity-score/config.yaml (4:11) - mlebench/competitions/text-normalization-challenge-russian-language/config.yaml (4:11) duplicated block id: 68 size: 8 cleaned lines of code in 2 files: - mlebench/competitions/learning-agency-lab-automated-essay-scoring-2/config.yaml (4:11) - mlebench/competitions/text-normalization-challenge-russian-language/config.yaml (4:11) duplicated block id: 69 size: 7 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2021-fgvc8/prepare.py (152:159) - mlebench/competitions/inaturalist-2019-fgvc6/prepare.py (125:132) duplicated block id: 70 size: 7 cleaned lines of code in 2 files: - mlebench/competitions/nfl-player-contact-detection/config.yaml (4:10) - mlebench/competitions/tgs-salt-identification-challenge/config.yaml (4:10) duplicated block id: 71 size: 7 cleaned lines of code in 2 files: - mlebench/competitions/icecube-neutrinos-in-deep-ice/config.yaml (4:10) - mlebench/competitions/jigsaw-toxic-comment-classification-challenge/config.yaml (4:10) duplicated block id: 72 size: 7 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2022-fgvc9/prepare.py (139:146) - mlebench/competitions/inaturalist-2019-fgvc6/prepare.py (125:132) duplicated block id: 73 size: 7 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2020-fgvc7/grade.py (7:16) - mlebench/competitions/herbarium-2022-fgvc9/grade.py (7:16) duplicated block id: 74 size: 7 cleaned lines of code in 2 files: - extras/kernels/download_kernel_references.py (37:43) - extras/kernels/download_kernels.py (121:127) duplicated block id: 75 size: 7 cleaned lines of code in 2 files: - mlebench/competitions/leaf-classification/grade.py (35:44) - mlebench/competitions/lmsys-chatbot-arena/grade.py (54:63) duplicated block id: 76 size: 7 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2020-fgvc7/prepare.py (152:159) - mlebench/competitions/inaturalist-2019-fgvc6/prepare.py (125:132) duplicated block id: 77 size: 7 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2021-fgvc8/grade.py (7:16) - mlebench/competitions/herbarium-2022-fgvc9/grade.py (7:16) duplicated block id: 78 size: 6 cleaned lines of code in 2 files: - mlebench/competitions/tensorflow2-question-answering/prepare.py (86:92) - mlebench/competitions/tensorflow2-question-answering/prepare.py (103:109) duplicated block id: 79 size: 6 cleaned lines of code in 2 files: - mlebench/competitions/jigsaw-unintended-bias-in-toxicity-classification/config.yaml (3:8) - mlebench/competitions/petfinder-pawpularity-score/config.yaml (3:8) duplicated block id: 80 size: 6 cleaned lines of code in 2 files: - mlebench/competitions/rsna-breast-cancer-detection/prepare.py (11:16) - mlebench/competitions/rsna-breast-cancer-detection/prepare.py (28:33) duplicated block id: 81 size: 6 cleaned lines of code in 2 files: - mlebench/competitions/dog-breed-identification/prepare.py (39:46) - mlebench/competitions/kuzushiji-recognition/prepare.py (39:45) duplicated block id: 82 size: 6 cleaned lines of code in 2 files: - extras/kernels/download_kernel_references.py (29:35) - extras/kernels/download_kernels.py (113:119) duplicated block id: 83 size: 6 cleaned lines of code in 2 files: - mlebench/competitions/jigsaw-unintended-bias-in-toxicity-classification/config.yaml (3:8) - mlebench/competitions/learning-agency-lab-automated-essay-scoring-2/config.yaml (3:8) duplicated block id: 84 size: 6 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2022-fgvc9/prepare.py (84:89) - mlebench/competitions/inaturalist-2019-fgvc6/prepare.py (97:102) duplicated block id: 85 size: 6 cleaned lines of code in 2 files: - agents/aide/config.yaml (21:26) - agents/aide/config.yaml (133:138) duplicated block id: 86 size: 6 cleaned lines of code in 2 files: - mlebench/competitions/osic-pulmonary-fibrosis-progression/config.yaml (3:8) - mlebench/competitions/siim-covid19-detection/config.yaml (3:8) duplicated block id: 87 size: 6 cleaned lines of code in 2 files: - mlebench/competitions/cdiscount-image-classification-challenge/config.yaml (3:8) - mlebench/competitions/vinbigdata-chest-xray-abnormalities-detection/config.yaml (3:8) duplicated block id: 88 size: 6 cleaned lines of code in 2 files: - mlebench/competitions/jigsaw-unintended-bias-in-toxicity-classification/config.yaml (3:8) - mlebench/competitions/tensorflow2-question-answering/config.yaml (3:8) duplicated block id: 89 size: 6 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2020-fgvc7/prepare.py (243:248) - mlebench/competitions/inaturalist-2019-fgvc6/prepare.py (189:194) duplicated block id: 90 size: 6 cleaned lines of code in 2 files: - mlebench/competitions/rsna-breast-cancer-detection/config.yaml (16:21) - mlebench/competitions/siim-covid19-detection/config.yaml (16:21) duplicated block id: 91 size: 6 cleaned lines of code in 2 files: - mlebench/competitions/AI4Code/config.yaml (3:8) - mlebench/competitions/nfl-player-contact-detection/config.yaml (3:8) duplicated block id: 92 size: 6 cleaned lines of code in 2 files: - mlebench/competitions/aptos2019-blindness-detection/config.yaml (3:8) - mlebench/competitions/vesuvius-challenge-ink-detection/config.yaml (3:8) duplicated block id: 93 size: 6 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2021-fgvc8/prepare.py (243:248) - mlebench/competitions/inaturalist-2019-fgvc6/prepare.py (189:194) duplicated block id: 94 size: 6 cleaned lines of code in 2 files: - mlebench/competitions/aptos2019-blindness-detection/config.yaml (3:8) - mlebench/competitions/lmsys-chatbot-arena/config.yaml (3:8) duplicated block id: 95 size: 6 cleaned lines of code in 2 files: - mlebench/competitions/google-quest-challenge/config.yaml (3:8) - mlebench/competitions/rsna-breast-cancer-detection/config.yaml (3:8) duplicated block id: 96 size: 6 cleaned lines of code in 2 files: - mlebench/competitions/herbarium-2022-fgvc9/prepare.py (225:230) - mlebench/competitions/inaturalist-2019-fgvc6/prepare.py (189:194)