GoogleCloudPlatform / document-ai-samples
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
Learn more...
Duplication Overall
system29% (21,671 lines)
dependency graphs: 2D graph | 3D graph | 3D graph (with duplicates)...
Duplication per Extension
ipynb34% (19,700 lines)
py13% (1,356 lines)
java47% (193 lines)
ts21% (150 lines)
js5% (90 lines)
yaml8% (68 lines)
css14% (63 lines)
html5% (31 lines)
tf5% (20 lines)
Duplication per Component (primary)
incubator-tools34% (19,079 lines)
document_ai_warehouse33% (1,245 lines)
community22% (352 lines)
web-app-demo16% (176 lines)
hitl-custom-review37% (129 lines)
uptraining_docai_processor_using_python12% (127 lines)
watermark-remover35% (90 lines)
document-processing-workflows6% (88 lines)
web-app-pix2info-python4% (81 lines)
paper_summarization7% (52 lines)
extract-tables75% (47 lines)
toolbox-batch-processing9% (46 lines)
document-json-explorer5% (45 lines)
ROOT8% (26 lines)
sql-pdf-python21% (26 lines)
extract-languages32% (16 lines)
tax-processing-pipeline-python1% (13 lines)
pdf-embedded-text9% (12 lines)
bq-connector<1% (12 lines)
pdf-splitter-python7% (9 lines)
classify-split-extract-workflow0% (0 lines)
cx-content-moderation0% (0 lines)
apps-script-google-drive0% (0 lines)
fraud-detection-python0% (0 lines)
filter-hitl-language0% (0 lines)

Duplication Between Components (50+ lines)

G document_ai_warehouse document_ai_warehouse incubator-tools incubator-tools document_ai_warehouse--incubator-tools 2261 paper_summarization paper_summarization document_ai_warehouse--paper_summarization 56 hitl-custom-review hitl-custom-review document_ai_warehouse--hitl-custom-review 108 incubator-tools--paper_summarization 1975 watermark-remover watermark-remover incubator-tools--watermark-remover 1706 uptraining_docai_processor_using_python uptraining_docai_processor_using_python incubator-tools--uptraining_docai_processor_using_python 1520 toolbox-batch-processing toolbox-batch-processing incubator-tools--toolbox-batch-processing 1266 hitl-custom-review--incubator-tools 1905 hitl-custom-review--watermark-remover 52 toolbox-batch-processing--uptraining_docai_processor_using_python 56 community community community--incubator-tools 118 extract-tables extract-tables community--extract-tables 139 web-app-pix2info-python web-app-pix2info-python community--web-app-pix2info-python 138 extract-languages extract-languages community--extract-languages 77

Download: SVG DOT (open online Graphviz editor)

Open 2D force graph... Open 3D force graph...

Show more details on duplication between components...
Longest Duplicates
The list of 50 longest duplicates.
See data for all 19,157 duplicates...
Size#FoldersFilesLinesCode
159 x 2 document_ai_warehouse/common/src/common/utils
document_ai_warehouse/do...house_processing_python
66:299 (58%)
41:274 (62%)
view
82 x 2 incubator-tools/ocr_upgradation_tool
incubator-tools/ocr_upgr...ol_using_enterprise_ocr
610:691 (7%)
549:630 (7%)
view
76 x 2 incubator-tools/map_ocr_...rmation_to_cde_entities
incubator-tools/ocr_upgradation_tool
143:218 (13%)
265:340 (7%)
view
66 x 2 incubator-tools/ocr_upgradation_tool
incubator-tools/ocr_upgr...ol_using_enterprise_ocr
270:335 (6%)
259:324 (6%)
view
66 x 2 incubator-tools/map_ocr_...rmation_to_cde_entities
incubator-tools/ocr_upgr...ol_using_enterprise_ocr
148:213 (11%)
259:324 (6%)
view
64 x 2 incubator-tools/advance_table_line_enhancement
incubator-tools/advance_table_line_enhancement
184:247 (14%)
191:254 (15%)
view
52 x 2 incubator-tools/lineitem_improver_crosspage
incubator-tools/signature_detection
48:99 (11%)
48:99 (9%)
view
52 x 2 incubator-tools/lineitem_improver_crosspage
incubator-tools/lineitem...rover_using_column_data
48:99 (11%)
48:99 (11%)
view
52 x 2 incubator-tools/ocr_upgradation_tool
incubator-tools/ocr_upgr...ol_using_enterprise_ocr
336:387 (4%)
207:258 (4%)
view
52 x 2 incubator-tools/lineitem...rover_using_column_data
incubator-tools/signature_detection
48:99 (11%)
48:99 (9%)
view
51 x 2 incubator-tools/tag_line...n_to_layout_text_blocks
incubator-tools/update_cde_schema
50:100 (12%)
50:100 (11%)
view
49 x 2 incubator-tools/bank_sta...nd_missing_items_finder
incubator-tools/map_ocr_...rmation_to_cde_entities
284:332 (4%)
275:323 (8%)
view
40 x 2 incubator-tools/character_box_removal
incubator-tools/ocr_upgr...ol_using_enterprise_ocr
47:86 (9%)
56:95 (3%)
view
40 x 2 incubator-tools/bank_sta...nd_missing_items_finder
incubator-tools/specific_format_line_items_tagging
37:76 (3%)
52:91 (4%)
view
40 x 2 incubator-tools/image_segmentation
incubator-tools/ocr_upgr...ol_using_enterprise_ocr
47:86 (10%)
56:95 (3%)
view
40 x 2 incubator-tools/image_segmentation
incubator-tools/lineitem...rover_using_column_data
47:86 (10%)
48:87 (9%)
view
40 x 2 incubator-tools/image_segmentation
incubator-tools/signature_detection
47:86 (10%)
48:87 (7%)
view
40 x 2 incubator-tools/character_box_removal
incubator-tools/signature_detection
47:86 (9%)
48:87 (7%)
view
40 x 2 incubator-tools/character_box_removal
incubator-tools/lineitem...rover_using_column_data
47:86 (9%)
48:87 (9%)
view
40 x 2 incubator-tools/image_segmentation
incubator-tools/lineitem_improver_crosspage
47:86 (10%)
48:87 (9%)
view
40 x 2 incubator-tools/advance_table_line_enhancement
incubator-tools/advance_table_line_enhancement
400:439 (9%)
383:422 (9%)
view
40 x 2 incubator-tools/character_box_removal
incubator-tools/lineitem_improver_crosspage
47:86 (9%)
48:87 (9%)
view
40 x 2 incubator-tools/lineitem_improver_crosspage
incubator-tools/ocr_upgr...ol_using_enterprise_ocr
48:87 (9%)
56:95 (3%)
view
40 x 2 incubator-tools/lineitem...rover_using_column_data
incubator-tools/ocr_upgr...ol_using_enterprise_ocr
48:87 (9%)
56:95 (3%)
view
40 x 2 incubator-tools/ocr_upgr...ol_using_enterprise_ocr
incubator-tools/signature_detection
56:95 (3%)
48:87 (7%)
view
40 x 2 incubator-tools/character_box_removal
incubator-tools/image_segmentation
47:86 (9%)
47:86 (10%)
view
40 x 2 document_ai_warehouse/common/src/common/utils
document_ai_warehouse/do...house_processing_python
303:361 (14%)
277:337 (15%)
view
39 x 2 incubator-tools/lineitem...rover_using_column_data
incubator-tools/signature_detection
404:442 (8%)
485:523 (7%)
view
39 x 2 document_ai_warehouse/do...house_processing_python
document_ai_warehouse/do...house_processing_python
1012:1050 (3%)
1120:1158 (3%)
view
39 x 2 incubator-tools/amount_r...n_from_words_to_numbers
incubator-tools/extending_entity_bounding_boxes
43:81 (5%)
41:79 (8%)
view
39 x 2 incubator-tools/add_vertices_to_entities
incubator-tools/map_ocr_...rmation_to_cde_entities
47:85 (10%)
48:86 (6%)
view
39 x 2 web-app-demo/Frontend/src/app
web-app-demo/Frontend/sr...nts/processor-selection
59:100 (47%)
73:114 (19%)
view
39 x 2 incubator-tools/add_vertices_to_entities
incubator-tools/convert_...se_to_documentai_format
47:85 (10%)
48:86 (8%)
view
39 x 2 incubator-tools/add_vertices_to_entities
incubator-tools/set_field_description_via_api
47:85 (10%)
50:88 (12%)
view
39 x 2 incubator-tools/convert_...se_to_documentai_format
incubator-tools/set_field_description_via_api
48:86 (8%)
50:88 (12%)
view
39 x 2 incubator-tools/image_segmentation
incubator-tools/lineitem...rover_using_column_data
328:366 (10%)
404:442 (8%)
view
39 x 2 incubator-tools/convert_...se_to_documentai_format
incubator-tools/map_ocr_...rmation_to_cde_entities
48:86 (8%)
48:86 (6%)
view
39 x 2 incubator-tools/tag_line...n_to_layout_text_blocks
incubator-tools/update_cde_schema
375:413 (9%)
403:441 (8%)
view
39 x 2 incubator-tools/image_segmentation
incubator-tools/signature_detection
328:366 (10%)
485:523 (7%)
view
39 x 2 incubator-tools/lineitem_improver_crosspage
incubator-tools/ocr_upgr...ol_using_enterprise_ocr
400:438 (8%)
1040:1078 (3%)
view
39 x 2 incubator-tools/entity_label_restructuring_tool
incubator-tools/export_i..._document_schema_gemini
622:660 (5%)
655:693 (5%)
view
39 x 2 incubator-tools/map_ocr_...rmation_to_cde_entities
incubator-tools/set_field_description_via_api
48:86 (6%)
50:88 (12%)
view
38 x 2 incubator-tools/labeled_dataset_validation
incubator-tools/reorder_...based_on_unique_strings
372:409 (9%)
326:363 (10%)
view
38 x 2 incubator-tools/best-pra...ng_poor_performing_docs
incubator-tools/reorder_...based_on_unique_strings
564:601 (6%)
326:363 (10%)
view
38 x 2 incubator-tools/best-pra...ng_poor_performing_docs
incubator-tools/labeled_dataset_validation
564:601 (6%)
372:409 (9%)
view
37 x 2 incubator-tools/rename_entity_type
incubator-tools/reorder_...based_on_unique_strings
246:282 (13%)
327:363 (10%)
view
37 x 2 incubator-tools/labeled_dataset_validation
incubator-tools/rename_entity_type
373:409 (9%)
246:282 (13%)
view
37 x 2 incubator-tools/ocr_upgradation_tool
incubator-tools/ocr_upgr...ol_using_enterprise_ocr
386:422 (3%)
323:359 (3%)
view
37 x 2 incubator-tools/cer_wer
incubator-tools/language_detection
10:46 (12%)
10:46 (6%)
view
37 x 2 incubator-tools/best-pra...ng_poor_performing_docs
incubator-tools/rename_entity_type
565:601 (6%)
246:282 (13%)
view
Duplicated Units
The list of top 6 duplicated units.
See data for all 6 unit duplicates...
Size#FoldersFilesLinesCode
39 x 2 web-app-demo/Frontend/sr...nts/processor-selection
web-app-demo/Frontend/src/app
73:115 
59:101 
view
21 x 2 web-app-demo/Frontend/sr...nts/processor-selection
web-app-demo/Frontend/src/app
134:158 
107:131 
view
17 x 2 document_ai_warehouse/common/src/common/utils
document_ai_warehouse/do...house_processing_python
0:0 
0:0 
view
12 x 2 document_ai_warehouse/common/src/common/utils
document_ai_warehouse/do...house_processing_python
0:0 
0:0 
view
7 x 2 web-app-pix2info-python/src/frontend
community/identity-form-...ler-python/src/frontend
265:273 
340:348 
view
11 x 2 community/codelabs/docai-form-parser
extract-tables
0:0 
0:0 
view