aws-samples / incremental-training-mlops
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 8% duplication:
    • 680 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 60 duplicated lines
  • 2 duplicates
system8% (60 lines)
Duplication per Extension
py8% (60 lines)
Duplication per Component (primary)
01-byoc/code8% (44 lines)
01-byoc/invoke_endpoint72% (8 lines)
02-increment-train/invoke_endpoint_a2i18% (8 lines)
02-increment-train0% (0 lines)
03-lambda-api/process_a2i_results0% (0 lines)
Longest Duplicates
The list of 2 longest duplicates.
See data for all 2 duplicates...
Size#FoldersFilesLinesCode
22 x 2 01-byoc/code
01-byoc/code
dataset.py
dataset.py
31:67 (19%)
158:181 (19%)
view
8 x 2 01-byoc/invoke_endpoint
02-increment-train/invoke_endpoint_a2i
lambda_function.py
lambda_function.py
20:28 (72%)
28:36 (18%)
view