aws-samples / amazon-sagemaker-bert-pytorch
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 35% duplication:
    • 315 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 112 duplicated lines
  • 3 duplicates
system35% (112 lines)
Duplication per Extension
py35% (112 lines)
Duplication per Component (primary)
code35% (112 lines)
Longest Duplicates
The list of 3 longest duplicates.
See data for all 3 duplicates...
Size#FoldersFilesLinesCode
30 x 2 code
code
deploy_ei.py
train_deploy.py
32:82 (56%)
246:293 (12%)
view
20 x 2 code
code
train_deploy.py
train_deploy.py
36:64 (8%)
76:104 (8%)
view
6 x 2 code
code
deploy_ei.py
train_deploy.py
11:18 (11%)
16:23 (2%)
view
Duplicated Units
The list of top 1 duplicated units.
See data for all 1 unit duplicate
Size#FoldersFilesLinesCode
29 x 2 code
code
deploy_ei.py
train_deploy.py
0:0 
0:0 
view