aws-samples / aws-glue-athena-tcga
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 28% duplication:
    • 1,042 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 298 duplicated lines
  • 29 duplicates
system28% (298 lines)
Duplication per Extension
py83% (277 lines)
yaml3% (21 lines)
Duplication per Component (primary)
pysparksamples83% (277 lines)
Cloudformation-solution3% (21 lines)
rsamples0% (0 lines)
Longest Duplicates
The list of 20 longest duplicates.
See data for all 29 duplicates...
Size#FoldersFilesLinesCode
42 x 2 pysparksamples
pysparksamples
clinical_job.py
mutation_job.py
97:154 (50%)
98:155 (50%)
view
33 x 2 pysparksamples
pysparksamples
clinical_job.py
mutation_job.py
40:91 (39%)
40:91 (39%)
view
33 x 2 pysparksamples
pysparksamples
copy_number_job.py
mutation_job.py
39:90 (41%)
40:91 (39%)
view
33 x 2 pysparksamples
pysparksamples
clinical_job.py
copy_number_job.py
40:91 (39%)
39:90 (41%)
view
20 x 2 pysparksamples
pysparksamples
copy_number_job.py
mutation_job.py
95:114 (25%)
98:117 (24%)
view
20 x 2 pysparksamples
pysparksamples
clinical_job.py
copy_number_job.py
97:116 (24%)
95:114 (25%)
view
17 x 2 pysparksamples
pysparksamples
copy_number_job.py
expression_job.py
95:111 (21%)
97:113 (19%)
view
17 x 2 pysparksamples
pysparksamples
clinical_job.py
expression_job.py
97:113 (20%)
97:113 (19%)
view
17 x 2 pysparksamples
pysparksamples
expression_job.py
mutation_job.py
97:113 (19%)
98:114 (20%)
view
13 x 2 pysparksamples
pysparksamples
clinical_job.py
copy_number_job.py
122:145 (15%)
114:138 (16%)
view
13 x 2 pysparksamples
pysparksamples
copy_number_job.py
mutation_job.py
114:138 (16%)
123:146 (15%)
view
12 x 2 pysparksamples
pysparksamples
clinical_job.py
expression_job.py
122:141 (14%)
116:134 (13%)
view
12 x 2 pysparksamples
pysparksamples
copy_number_job.py
expression_job.py
114:134 (15%)
116:134 (13%)
view
12 x 2 pysparksamples
pysparksamples
expression_job.py
mutation_job.py
116:134 (13%)
123:142 (14%)
view
11 x 2 pysparksamples
pysparksamples
clinical_job.py
expression_job.py
78:91 (13%)
74:87 (12%)
view
11 x 2 pysparksamples
pysparksamples
expression_job.py
mutation_job.py
74:87 (12%)
78:91 (13%)
view
11 x 2 pysparksamples
pysparksamples
copy_number_job.py
expression_job.py
77:90 (13%)
74:87 (12%)
view
8 x 2 pysparksamples
pysparksamples
expression_job.py
mutation_job.py
143:150 (9%)
67:74 (9%)
view
8 x 2 pysparksamples
pysparksamples
clinical_job.py
expression_job.py
67:74 (9%)
143:150 (9%)
view
8 x 2 pysparksamples
pysparksamples
copy_number_job.py
expression_job.py
66:73 (10%)
143:150 (9%)
view
Duplicated Units
The list of top 1 duplicated units.
See data for all 1 unit duplicate
Size#FoldersFilesLinesCode
7 x 4 pysparksamples
pysparksamples
pysparksamples
pysparksamples
expression_job.py
mutation_job.py
clinical_job.py
copy_number_job.py
0:0 
0:0 
0:0 
0:0 
view