aws-samples / sagemaker-distributed-training-pytorch-kr
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 27% duplication:
    • 1,423 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 385 duplicated lines
  • 10 duplicates
system27% (385 lines)
Duplication per Extension
py27% (385 lines)
Duplication per Component (primary)
train_code77% (325 lines)
src_dir6% (60 lines)
util0% (0 lines)
Longest Duplicates
The list of 10 longest duplicates.
See data for all 10 duplicates...
Size#FoldersFilesLinesCode
59 x 2 distributed_training/train_code
distributed_training/train_code
pytorch_mnist.py
pytorch_mnist_smdp.py
33:100 (34%)
47:115 (32%)
view
37 x 2 distributed_training/train_code
distributed_training/train_code
pytorch_mnist.py
pytorch_mnist_smdp.py
102:139 (21%)
117:158 (20%)
view
32 x 2 distributed_training/train_code
distributed_training/train_code
pytorch_mnist.py
pytorch_mnist_smdp.py
174:206 (18%)
209:247 (17%)
view
13 x 2 distributed_training/train_code
distributed_training/train_code
pytorch_mnist.py
pytorch_mnist_smdp.py
156:172 (7%)
183:199 (7%)
view
12 x 2 distributed_training/src_dir
distributed_training/src_dir
util.py
util.py
230:242 (6%)
263:275 (6%)
view
11 x 2 distributed_training/train_code
distributed_training/train_code
pytorch_mnist.py
pytorch_mnist_smdp.py
210:223 (6%)
259:272 (6%)
view
11 x 2 distributed_training/train_code
distributed_training/train_code
pytorch_mnist.py
pytorch_mnist_smdp.py
145:156 (6%)
166:178 (6%)
view
6 x 2 distributed_training/src_dir
distributed_training/src_dir
dis_util.py
dis_util.py
264:269 (2%)
275:280 (2%)
view
6 x 2 distributed_training/src_dir
distributed_training/src_dir
main_trainer.py
main_trainer.py
394:403 (1%)
508:515 (1%)
view
6 x 2 distributed_training/src_dir
distributed_training/src_dir
main_trainer.py
main_trainer.py
421:426 (1%)
532:537 (1%)
view
Duplicated Units
The list of top 2 duplicated units.
See data for all 2 unit duplicates...
Size#FoldersFilesLinesCode
20 x 2 distributed_training/train_code
distributed_training/train_code
pytorch_mnist.py
pytorch_mnist_smdp.py
0:0 
0:0 
view
16 x 2 distributed_training/train_code
distributed_training/train_code
pytorch_mnist.py
pytorch_mnist_smdp.py
0:0 
0:0 
view