aws-samples / amazon-emr-with-juicefs
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 1% duplication:
    • 851 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 12 duplicated lines
  • 1 duplicates
system1% (12 lines)
Duplication per Extension
py4% (12 lines)
Duplication per Component (primary)
source/benchmark-sample4% (12 lines)
ROOT0% (0 lines)
source/lib0% (0 lines)
source/benchmark-sample/tpcds-gen/src/main/java/org/notmysock/tpcds0% (0 lines)
source0% (0 lines)
deployment/cdk-solution-helper0% (0 lines)
Longest Duplicates
The list of 1 longest duplicates.
See data for all 1 duplicate
Size#FoldersFilesLinesCode
6 x 2 source/benchmark-sample
source/benchmark-sample
emr-benchmark.py
run-query.py
66:73 (2%)
24:31 (8%)
view