duplicated block id: 1 size: 167 cleaned lines of code in 2 files: - notebooks/util/postproc/boxes.py (11:229) - pipeline/postprocessing/fn-postprocess/util/boxes.py (11:229) duplicated block id: 2 size: 39 cleaned lines of code in 2 files: - notebooks/util/postproc/deser.py (11:72) - pipeline/postprocessing/fn-postprocess/util/deser.py (11:72) duplicated block id: 3 size: 38 cleaned lines of code in 2 files: - notebooks/util/postproc/config.py (13:79) - pipeline/postprocessing/fn-postprocess/util/config.py (13:79) duplicated block id: 4 size: 17 cleaned lines of code in 2 files: - notebooks/annotation/ocr-bbox-and-validation.liquid.tpl.html (3:21) - notebooks/review/fields-validation.liquid.html (3:29) duplicated block id: 5 size: 15 cleaned lines of code in 2 files: - notebooks/util/smgt.py (31:45) - notebooks/util/smgt.py (49:63) duplicated block id: 6 size: 15 cleaned lines of code in 2 files: - notebooks/src/code/data/ner.py (169:183) - notebooks/src/code/data/ner.py (231:245) duplicated block id: 7 size: 12 cleaned lines of code in 2 files: - pipeline/__init__.py (77:88) - pipeline/__init__.py (107:118) duplicated block id: 8 size: 12 cleaned lines of code in 2 files: - pipeline/__init__.py (77:88) - pipeline/__init__.py (92:103) duplicated block id: 9 size: 12 cleaned lines of code in 2 files: - notebooks/src/code/data/geometry.py (159:170) - notebooks/src/code/data/ner.py (169:180) duplicated block id: 10 size: 12 cleaned lines of code in 2 files: - notebooks/src/code/data/geometry.py (159:170) - notebooks/src/code/data/ner.py (231:242) duplicated block id: 11 size: 12 cleaned lines of code in 2 files: - pipeline/__init__.py (92:103) - pipeline/__init__.py (107:118) duplicated block id: 12 size: 10 cleaned lines of code in 2 files: - notebooks/src/code/data/mlm.py (105:115) - notebooks/src/code/data/ner.py (148:158) duplicated block id: 13 size: 10 cleaned lines of code in 2 files: - notebooks/src/code/data/mlm.py (152:162) - notebooks/src/code/data/ner.py (307:329) duplicated block id: 14 size: 9 cleaned lines of code in 2 files: - cdk_demo_stack.py (64:72) - pipeline/__init__.py (92:100) duplicated block id: 15 size: 9 cleaned lines of code in 2 files: - cdk_demo_stack.py (64:72) - pipeline/__init__.py (77:85) duplicated block id: 16 size: 9 cleaned lines of code in 2 files: - cdk_demo_stack.py (64:72) - pipeline/__init__.py (107:115) duplicated block id: 17 size: 9 cleaned lines of code in 2 files: - notebooks/src/code/data/mlm.py (69:82) - notebooks/src/code/data/ner.py (80:93) duplicated block id: 18 size: 9 cleaned lines of code in 2 files: - pipeline/ocr/sfn_semaphore/__init__.py (298:306) - pipeline/ocr/sfn_semaphore/__init__.py (573:606) duplicated block id: 19 size: 9 cleaned lines of code in 2 files: - pipeline/ocr/sfn_semaphore/__init__.py (50:58) - pipeline/ocr/sfn_semaphore/__init__.py (186:194) duplicated block id: 20 size: 9 cleaned lines of code in 2 files: - notebooks/review/fields-validation.liquid.html (88:96) - notebooks/review/fields-validation.liquid.html (133:141) duplicated block id: 21 size: 8 cleaned lines of code in 2 files: - notebooks/src/code/data/base.py (162:169) - notebooks/src/code/data/mlm.py (147:154) duplicated block id: 22 size: 8 cleaned lines of code in 2 files: - notebooks/src/code/data/mlm.py (57:65) - notebooks/src/code/data/ner.py (65:73) duplicated block id: 23 size: 7 cleaned lines of code in 2 files: - pipeline/ocr/__init__.py (146:152) - pipeline/review/__init__.py (92:98) duplicated block id: 24 size: 7 cleaned lines of code in 2 files: - notebooks/src/code/train.py (70:76) - notebooks/src/code/train.py (79:85) duplicated block id: 25 size: 7 cleaned lines of code in 2 files: - pipeline/__init__.py (125:131) - pipeline/__init__.py (211:217) duplicated block id: 26 size: 7 cleaned lines of code in 2 files: - pipeline/ocr/sfn_semaphore/__init__.py (66:72) - pipeline/ocr/sfn_semaphore/__init__.py (422:430) duplicated block id: 27 size: 7 cleaned lines of code in 2 files: - pipeline/iam_utils.py (72:89) - pipeline/iam_utils.py (105:122) duplicated block id: 28 size: 7 cleaned lines of code in 2 files: - notebooks/src/code/data/base.py (60:66) - notebooks/src/code/data/base.py (85:91) duplicated block id: 29 size: 6 cleaned lines of code in 2 files: - pipeline/enrichment/__init__.py (55:60) - pipeline/review/__init__.py (51:56) duplicated block id: 30 size: 6 cleaned lines of code in 2 files: - notebooks/src/code/train.py (51:56) - notebooks/src/code/train.py (60:65) duplicated block id: 31 size: 6 cleaned lines of code in 2 files: - pipeline/ocr/sfn_semaphore/__init__.py (50:55) - pipeline/ocr/sfn_semaphore/__init__.py (294:299) duplicated block id: 32 size: 6 cleaned lines of code in 2 files: - pipeline/ocr/sfn_semaphore/__init__.py (501:507) - pipeline/ocr/sfn_semaphore/__init__.py (652:660) duplicated block id: 33 size: 6 cleaned lines of code in 2 files: - pipeline/ocr/sfn_semaphore/__init__.py (169:177) - pipeline/ocr/sfn_semaphore/__init__.py (652:660) duplicated block id: 34 size: 6 cleaned lines of code in 2 files: - pipeline/ocr/sfn_semaphore/__init__.py (169:177) - pipeline/ocr/sfn_semaphore/__init__.py (501:507) duplicated block id: 35 size: 6 cleaned lines of code in 2 files: - pipeline/ocr/sfn_semaphore/__init__.py (74:79) - pipeline/ocr/sfn_semaphore/__init__.py (116:121) duplicated block id: 36 size: 6 cleaned lines of code in 2 files: - notebooks/src/code/data/mlm.py (49:54) - notebooks/src/code/data/ner.py (57:62) duplicated block id: 37 size: 6 cleaned lines of code in 2 files: - pipeline/ocr/__init__.py (145:150) - pipeline/review/__init__.py (79:84) duplicated block id: 38 size: 6 cleaned lines of code in 2 files: - pipeline/ocr/sfn_semaphore/__init__.py (186:191) - pipeline/ocr/sfn_semaphore/__init__.py (294:299) duplicated block id: 39 size: 6 cleaned lines of code in 2 files: - pipeline/enrichment/__init__.py (78:84) - pipeline/postprocessing/__init__.py (184:190)