aws-samples / aws-research-workshops
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 8% duplication:
    • 5,299 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 456 duplicated lines
  • 66 duplicates
system8% (456 lines)
Duplication per Extension
yaml12% (236 lines)
html58% (108 lines)
py4% (96 lines)
yml2% (16 lines)
Duplication per Component (primary)
notebooks/building_data_lakes13% (199 lines)
notebooks/iot_greengress4% (76 lines)
notebooks/ml_tensorflow33% (62 lines)
notebooks/escience_series21% (62 lines)
notebooks/hpc35% (27 lines)
notebooks/serverless_apps3% (16 lines)
notebooks/parallelcluster4% (14 lines)
lib0% (0 lines)
ROOT0% (0 lines)
notebooks/intro_to_aws0% (0 lines)
notebooks/container0% (0 lines)
notebooks/genomics_pipeline0% (0 lines)
notebooks/spot0% (0 lines)
notebooks0% (0 lines)
notebooks/account-factory0% (0 lines)

Duplication Between Components (50+ lines)

G notebooks/escience_series notebooks/escience_series notebooks/ml_tensorflow notebooks/ml_tensorflow notebooks/escience_series--notebooks/ml_tensorflow 124 notebooks/building_data_lakes notebooks/building_data_lakes notebooks/serverless_apps notebooks/serverless_apps notebooks/building_data_lakes--notebooks/serverless_apps 53

Download: SVG DOT (open online Graphviz editor)

Open 3D force graph...

Show more details on duplication between components...
Longest Duplicates
The list of 20 longest duplicates.
See data for all 66 duplicates...
Size#FoldersFilesLinesCode
54 x 2 notebooks/escience_series
notebooks/ml_tensorflow
input.html
input.html
1:58 (100%)
1:57 (100%)
view
12 x 2 notebooks/building_data_lakes/cfn
notebooks/building_data_lakes/cfn
sf_orch_glue.yaml
sf_orch_glue.yaml
52:63 (3%)
83:94 (3%)
view
12 x 2 notebooks/building_data_lakes/cfn
notebooks/building_data_lakes/cfn
emr.yaml
kinesis-swiss-army.yaml
297:308 (3%)
24:35 (1%)
view
12 x 2 notebooks/building_data_lakes/cfn
notebooks/building_data_lakes/cfn
kinesis-swiss-army.yaml
kinesis-swiss-army.yaml
220:232 (1%)
374:386 (1%)
view
10 x 2 notebooks/building_data_lakes/cfn
notebooks/building_data_lakes/cfn
kinesis-swiss-army.yaml
kinesis-swiss-army.yaml
621:630 (1%)
685:694 (1%)
view
10 x 2 notebooks/building_data_lakes/cfn
notebooks/iot_greengress/health_tracker
emr.yaml
greengrass-device.yaml
142:151 (3%)
121:130 (9%)
view
9 x 2 notebooks/hpc
notebooks/hpc
fastq-batch-roles.yaml
fastq-batch-roles.yaml
8:16 (12%)
25:33 (12%)
view
9 x 2 notebooks/hpc
notebooks/hpc
fastq-batch-roles.yaml
fastq-batch-roles.yaml
8:16 (12%)
50:58 (12%)
view
9 x 2 notebooks/hpc
notebooks/hpc
fastq-batch-roles.yaml
fastq-batch-roles.yaml
25:33 (12%)
50:58 (12%)
view
9 x 2 notebooks/iot_greengress...lth_tracker/tracker/ggd
notebooks/iot_greengress...lth_tracker/tracker/ggd
heartbeat.py
heartrate.py
151:159 (8%)
110:118 (9%)
view
9 x 2 notebooks/iot_greengress...lth_tracker/tracker/ggd
notebooks/iot_greengress...lth_tracker/tracker/ggd
heartrate.py
utils.py
70:79 (9%)
70:79 (6%)
view
9 x 2 notebooks/building_data_lakes/cfn
notebooks/building_data_lakes/cfn
emr.yaml
emr.yaml
140:148 (2%)
157:165 (2%)
view
8 x 2 notebooks/building_data_lakes/cfn
notebooks/building_data_lakes/cfn
kinesis-swiss-army.yaml
kinesis-swiss-army.yaml
24:31 (1%)
51:58 (1%)
view
8 x 2 notebooks/escience_series
notebooks/ml_tensorflow
mnist.py
mnist.py
136:151 (8%)
145:160 (8%)
view
8 x 2 notebooks/building_data_lakes/cfn
notebooks/serverless_apps
kinesis-swiss-army.yaml
code-pipeline.yml
51:58 (1%)
220:227 (3%)
view
8 x 2 notebooks/building_data_lakes/cfn
notebooks/building_data_lakes/cfn
emr.yaml
kinesis-swiss-army.yaml
297:304 (2%)
51:58 (1%)
view
8 x 2 notebooks/serverless_apps
notebooks/serverless_apps
code-pipeline.yml
code-pipeline.yml
154:161 (3%)
220:227 (3%)
view
8 x 2 notebooks/building_data_lakes/cfn
notebooks/serverless_apps
emr.yaml
code-pipeline.yml
297:304 (2%)
220:227 (3%)
view
8 x 2 notebooks/building_data_lakes/cfn
notebooks/serverless_apps
kinesis-swiss-army.yaml
code-pipeline.yml
24:31 (1%)
220:227 (3%)
view
8 x 2 notebooks/building_data_lakes/cfn
notebooks/serverless_apps
kinesis-swiss-army.yaml
code-pipeline.yml
51:58 (1%)
154:161 (3%)
view
Duplicated Units
The list of top 1 duplicated units.
See data for all 1 unit duplicate
Size#FoldersFilesLinesCode
11 x 2 notebooks/ml_tensorflow
notebooks/escience_series
mnist.py
mnist.py
0:0 
0:0 
view