aws / sagemaker-training-toolkit
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 11% duplication:
    • 2,060 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 244 duplicated lines
  • 17 duplicates
system11% (244 lines)
Duplication per Extension
py12% (206 lines)
yml66% (38 lines)
Duplication per Component (primary)
src/sagemaker_training12% (206 lines)
ROOT29% (38 lines)
src0% (0 lines)
src/sagemaker_training/cli0% (0 lines)
src/sagemaker_training/c0% (0 lines)
Longest Duplicates
The list of 17 longest duplicates.
See data for all 17 duplicates...
Size#FoldersFilesLinesCode
13 x 2 src/sagemaker_training
src/sagemaker_training
recordio.py
recordio.py
116:128 (11%)
155:167 (11%)
view
12 x 2 src/sagemaker_training
src/sagemaker_training
mpi.py
smdataparallel.py
114:125 (6%)
38:49 (5%)
view
12 x 2 src/sagemaker_training
src/sagemaker_training
mpi.py
smdataparallel.py
152:167 (6%)
76:91 (5%)
view
12 x 2 ROOT
ROOT
buildspec-release.yml
buildspec.yml
20:39 (41%)
18:36 (54%)
view
12 x 2 src/sagemaker_training
src/sagemaker_training
mpi.py
smdataparallel.py
215:226 (6%)
136:147 (5%)
view
9 x 2 src/sagemaker_training
src/sagemaker_training
mpi.py
smdataparallel.py
229:237 (5%)
151:159 (4%)
view
9 x 2 src/sagemaker_training
src/sagemaker_training
mpi.py
smdataparallel.py
202:210 (5%)
127:135 (4%)
view
8 x 2 src/sagemaker_training
src/sagemaker_training
runner.py
runner.py
75:82 (12%)
92:99 (12%)
view
8 x 2 src/sagemaker_training
src/sagemaker_training
mpi.py
smdataparallel.py
260:296 (4%)
284:321 (3%)
view
7 x 2 src/sagemaker_training
src/sagemaker_training
process.py
smdataparallel.py
287:293 (4%)
266:272 (3%)
view
7 x 2 ROOT
ROOT
buildspec-release.yml
buildspec.yml
1:9 (24%)
1:9 (31%)
view
7 x 2 src/sagemaker_training
src/sagemaker_training
process.py
smdataparallel.py
295:302 (4%)
274:280 (3%)
view
6 x 2 src/sagemaker_training
src/sagemaker_training
process.py
process.py
287:292 (3%)
295:300 (3%)
view
6 x 2 src/sagemaker_training
src/sagemaker_training
mpi.py
smdataparallel.py
194:199 (3%)
119:124 (2%)
view
6 x 2 src/sagemaker_training
src/sagemaker_training
smdataparallel.py
smdataparallel.py
266:271 (2%)
274:279 (2%)
view
6 x 2 src/sagemaker_training
src/sagemaker_training
process.py
smdataparallel.py
287:292 (3%)
274:279 (2%)
view
6 x 2 src/sagemaker_training
src/sagemaker_training
process.py
smdataparallel.py
295:300 (3%)
266:271 (2%)
view
Duplicated Units
The list of top 1 duplicated units.
See data for all 1 unit duplicate
Size#FoldersFilesLinesCode
8 x 2 src/sagemaker_training
src/sagemaker_training
environment.py
files.py
0:0 
0:0 
view