aws-samples / machine-learning-samples
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 1% duplication:
    • 1,459 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 18 duplicated lines
  • 1 duplicates
system1% (18 lines)
Duplication per Extension
py1% (18 lines)
Duplication per Component (primary)
k-fold-cross-validation7% (18 lines)
targeted-marketing-python0% (0 lines)
social-media0% (0 lines)
mobile-ios0% (0 lines)
mobile-android0% (0 lines)
cost-based-ml0% (0 lines)
targeted-marketing-scala0% (0 lines)
ml-tools-python0% (0 lines)
Longest Duplicates
The list of 1 longest duplicates.
See data for all 1 duplicate
Size#FoldersFilesLinesCode
9 x 2 k-fold-cross-validation
k-fold-cross-validation
build_folds.py
collect_perf.py
71:81 (16%)
107:116 (14%)
view