aws-samples / data-discovery-using-glue-comprehend
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 20% duplication:
    • 824 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 166 duplicated lines
  • 5 duplicates
system20% (166 lines)
Duplication per Extension
py32% (86 lines)
yml15% (80 lines)
Duplication per Component (primary)
scripts32% (86 lines)
CloudFormation-template15% (80 lines)
QuickSight0% (0 lines)

Duplication Between Components (50+ lines)

G CloudFormation-template CloudFormation-template scripts scripts CloudFormation-template--scripts 108

Download: SVG DOT (open online Graphviz editor)

Open 3D force graph...

Show more details on duplication between components...
Longest Duplicates
The list of 5 longest duplicates.
See data for all 5 duplicates...
Size#FoldersFilesLinesCode
54 x 2 CloudFormation-template
scripts
simplify-data-discovery-for-business-...
trigger_glue_crawler.py
172:246 (10%)
5:77 (100%)
view
8 x 2 scripts
scripts
Glue_Comprehend_Job.py
glue_comprehend_workflow_custom.py
71:78 (14%)
84:91 (9%)
view
8 x 2 scripts
scripts
Glue_Comprehend_Job.py
Inference_custom_entity_recognition.py
11:18 (14%)
16:23 (19%)
view
7 x 2 CloudFormation-template
CloudFormation-template
simplify-data-discovery-for-business-...
simplify-data-discovery-for-business-...
63:69 (1%)
262:268 (1%)
view
6 x 2 CloudFormation-template
CloudFormation-template
simplify-data-discovery-for-business-...
simplify-data-discovery-for-business-...
252:257 (1%)
459:464 (1%)
view