awslabs / aws-customer-churn-pipeline
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 80% duplication:
    • 3,174 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 2,553 duplicated lines
  • 317 duplicates
system80% (2,553 lines)
Duplication per Extension
yaml86% (2,210 lines)
py62% (343 lines)
Duplication per Component (primary)
cfn86% (2,210 lines)
scripts62% (343 lines)
ROOT0% (0 lines)
Longest Duplicates
The list of 20 longest duplicates.
See data for all 317 duplicates...
Size#FoldersFilesLinesCode
287 x 2 cfn
cfn
classification_pipeline.yaml
time_to_event_pipeline.yaml
838:1174 (25%)
841:1177 (25%)
view
278 x 2 cfn
cfn
classification_pipeline.yaml
time_to_event_pipeline.yaml
1:288 (25%)
1:288 (24%)
view
161 x 2 cfn
cfn
classification_pipeline.yaml
time_to_event_pipeline.yaml
514:674 (14%)
515:675 (14%)
view
111 x 2 cfn
cfn
classification_pipeline.yaml
time_to_event_pipeline.yaml
723:837 (10%)
724:838 (9%)
view
68 x 2 cfn
cfn
classification_pipeline.yaml
time_to_event_pipeline.yaml
393:460 (6%)
392:459 (6%)
view
55 x 2 cfn
cfn
classification_pipeline.yaml
time_to_event_pipeline.yaml
290:344 (4%)
290:344 (4%)
view
48 x 2 cfn
cfn
classification_pipeline.yaml
time_to_event_pipeline.yaml
465:512 (4%)
466:513 (4%)
view
46 x 2 cfn
cfn
classification_pipeline.yaml
time_to_event_pipeline.yaml
676:721 (4%)
677:722 (4%)
view
35 x 2 scripts
scripts
coxph_preprocessing.py
preprocessing.py
14:67 (23%)
16:68 (22%)
view
32 x 2 cfn
cfn
classification_pipeline.yaml
time_to_event_pipeline.yaml
346:377 (2%)
346:377 (2%)
view
28 x 2 scripts
scripts
coxph_preprocessing.py
preprocessing.py
147:181 (18%)
156:189 (17%)
view
27 x 2 cfn
cfn
classification_pipeline.yaml
time_to_event_pipeline.yaml
515:541 (2%)
432:458 (2%)
view
27 x 2 cfn
cfn
classification_pipeline.yaml
time_to_event_pipeline.yaml
433:459 (2%)
516:542 (2%)
view
27 x 2 cfn
cfn
time_to_event_pipeline.yaml
time_to_event_pipeline.yaml
432:458 (2%)
516:542 (2%)
view
27 x 2 cfn
cfn
classification_pipeline.yaml
classification_pipeline.yaml
433:459 (2%)
515:541 (2%)
view
26 x 2 cfn
cfn
classification_pipeline.yaml
time_to_event_pipeline.yaml
735:760 (2%)
892:917 (2%)
view
26 x 2 cfn
cfn
time_to_event_pipeline.yaml
time_to_event_pipeline.yaml
736:761 (2%)
892:917 (2%)
view
26 x 2 cfn
cfn
classification_pipeline.yaml
time_to_event_pipeline.yaml
889:914 (2%)
736:761 (2%)
view
26 x 2 cfn
cfn
classification_pipeline.yaml
classification_pipeline.yaml
735:760 (2%)
889:914 (2%)
view
23 x 2 scripts
scripts
coxph_preprocessing.py
preprocessing.py
198:227 (15%)
211:240 (14%)
view