awslabs / python-deequ
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 3% duplication:
    • 1,566 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 62 duplicated lines
  • 6 duplicates
system3% (62 lines)
Duplication per Extension
py4% (62 lines)
Duplication per Component (primary)
pydeequ4% (62 lines)
ROOT0% (0 lines)
Longest Duplicates
The list of 6 longest duplicates.
See data for all 6 duplicates...
Size#FoldersFilesLinesCode
8 x 2 pydeequ
pydeequ
profiles.py
verification.py
103:112 (3%)
152:164 (6%)
view
7 x 2 pydeequ
pydeequ
verification.py
verification.py
71:78 (6%)
138:144 (6%)
view
7 x 2 pydeequ
pydeequ
profiles.py
profiles.py
387:393 (3%)
432:438 (3%)
view
6 x 2 pydeequ
pydeequ
analyzers.py
verification.py
81:86 (1%)
138:143 (5%)
view
6 x 2 pydeequ
pydeequ
anomaly_detection.py
anomaly_detection.py
40:47 (4%)
69:76 (4%)
view
6 x 2 pydeequ
pydeequ
analyzers.py
verification.py
81:86 (1%)
71:77 (5%)
view