aws-samples / aws-sagemaker-byoc-end2end
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 4% duplication:
    • 694 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 34 duplicated lines
  • 2 duplicates
system4% (34 lines)
Duplication per Extension
py4% (34 lines)
Duplication per Component (primary)
pipelines3% (14 lines)
containers/batch_transform/resources25% (10 lines)
containers/training/resources15% (10 lines)
containers/serving/resources0% (0 lines)
processing0% (0 lines)
scripts0% (0 lines)
Longest Duplicates
The list of 2 longest duplicates.
See data for all 2 duplicates...
Size#FoldersFilesLinesCode
10 x 2 containers/batch_transform/resources
containers/training/resources
evaluate.py
train.py
13:24 (25%)
13:24 (15%)
view
7 x 2 pipelines
pipelines
byoc_pipeline.py
byoc_pipeline.py
239:245 (1%)
251:257 (1%)
view
Duplicated Units
The list of top 1 duplicated units.
See data for all 1 unit duplicate
Size#FoldersFilesLinesCode
8 x 2 containers/batch_transform/resources
containers/training/resources
evaluate.py
train.py
0:0 
0:0 
view