amazon-research / relaxed-adaptive-projection
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 19% duplication:
    • 1,388 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 269 duplicated lines
  • 6 duplicates
system19% (269 lines)
Duplication per Extension
py19% (269 lines)
Duplication per Component (primary)
datasets76% (219 lines)
ROOT7% (50 lines)
relaxed_adaptive_projection0% (0 lines)
privacy_budget_tracking0% (0 lines)
Longest Duplicates
The list of 6 longest duplicates.
See data for all 6 duplicates...
Size#FoldersFilesLinesCode
73 x 2 datasets
datasets
loans.py
toy_binary.py
29:151 (82%)
29:146 (82%)
view
73 x 2 datasets
datasets
adult.py
loans.py
29:151 (78%)
29:151 (82%)
view
73 x 2 datasets
datasets
adult.py
toy_binary.py
29:151 (78%)
29:146 (82%)
view
9 x 2 ROOT
ROOT
statistickway.py
statistickway_threshold.py
35:46 (27%)
38:49 (21%)
view
8 x 2 ROOT
ROOT
main.py
main.py
382:389 (1%)
504:511 (1%)
view
8 x 2 ROOT
ROOT
statistickway.py
statistickway_threshold.py
9:21 (24%)
9:22 (19%)
view
Duplicated Units
The list of top 4 duplicated units.
See data for all 4 unit duplicates...
Size#FoldersFilesLinesCode
29 x 2 datasets
datasets
loans.py
adult.py
0:0 
0:0 
view
16 x 3 datasets
datasets
datasets
loans.py
toy_binary.py
adult.py
0:0 
0:0 
0:0 
view
15 x 3 datasets
datasets
datasets
loans.py
toy_binary.py
adult.py
0:0 
0:0 
0:0 
view
13 x 3 datasets
datasets
datasets
loans.py
toy_binary.py
adult.py
0:0 
0:0 
0:0 
view