awslabs / sockeye
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 41% duplication:
    • 15,975 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 6,626 duplicated lines
  • 218 duplicates
system41% (6,626 lines)
Duplication per Extension
py42% (6,626 lines)
Duplication per Component (primary)
sockeye43% (6,626 lines)
sockeye_contrib0% (0 lines)
sockeye_contrib/vistools/templates0% (0 lines)
sockeye_contrib/vistools0% (0 lines)
sockeye_contrib/benchmark0% (0 lines)
ROOT0% (0 lines)
Longest Duplicates
The list of 20 longest duplicates.
See data for all 218 duplicates...
Size#FoldersFilesLinesCode
182 x 2 sockeye
sockeye
data_io.py
data_io_pt.py
1011:1328 (14%)
990:1307 (15%)
view
151 x 2 sockeye
sockeye
data_io.py
data_io_pt.py
218:449 (12%)
199:431 (12%)
view
74 x 2 sockeye
sockeye
train.py
train_pt.py
514:605 (9%)
491:582 (10%)
view
57 x 2 sockeye
sockeye
inference.py
inference_pt.py
713:807 (8%)
711:805 (9%)
view
57 x 2 sockeye
sockeye
inference.py
inference_pt.py
109:206 (8%)
110:207 (9%)
view
53 x 2 sockeye
sockeye
data_io.py
data_io_pt.py
1640:1704 (4%)
1607:1671 (4%)
view
51 x 2 sockeye
sockeye
training.py
training_pt.py
205:267 (8%)
197:259 (8%)
view
49 x 2 sockeye
sockeye
train.py
train_pt.py
19:128 (6%)
19:125 (6%)
view
44 x 2 sockeye
sockeye
loss.py
loss_pt.py
66:126 (32%)
64:125 (25%)
view
43 x 2 sockeye
sockeye
data_io.py
data_io_pt.py
41:127 (3%)
41:127 (3%)
view
43 x 2 sockeye
sockeye
inference.py
inference_pt.py
924:988 (6%)
905:969 (7%)
view
42 x 2 sockeye
sockeye
train.py
train_pt.py
357:403 (5%)
335:381 (5%)
view
39 x 2 sockeye
sockeye
prepare_data.py
prepare_data_pt.py
25:74 (48%)
25:74 (48%)
view
38 x 2 sockeye
sockeye
data_io.py
data_io_pt.py
1777:1826 (3%)
1744:1793 (3%)
view
38 x 2 sockeye
sockeye
training.py
training_pt.py
848:898 (6%)
827:877 (6%)
view
37 x 2 sockeye
sockeye
train.py
train_pt.py
857:906 (4%)
799:848 (5%)
view
35 x 2 sockeye
sockeye
data_io.py
data_io_pt.py
695:736 (2%)
676:717 (2%)
view
35 x 2 sockeye
sockeye
checkpoint_decoder.py
checkpoint_decoder_pt.py
168:206 (27%)
163:201 (26%)
view
35 x 2 sockeye
sockeye
data_io.py
data_io_pt.py
761:803 (2%)
742:784 (2%)
view
35 x 2 sockeye
sockeye
train.py
train_pt.py
311:353 (4%)
289:331 (4%)
view
Duplicated Units
The list of top 20 duplicated units.
See data for all 59 unit duplicates...
Size#FoldersFilesLinesCode
55 x 2 sockeye
sockeye
train.py
train_pt.py
0:0 
0:0 
view
39 x 2 sockeye
sockeye
data_io_pt.py
data_io.py
0:0 
0:0 
view
38 x 2 sockeye
sockeye
data_io_pt.py
data_io.py
0:0 
0:0 
view
48 x 2 sockeye
sockeye
training_pt.py
training.py
0:0 
0:0 
view
27 x 2 sockeye
sockeye
data_io_pt.py
data_io.py
0:0 
0:0 
view
38 x 2 sockeye
sockeye
translate_pt.py
translate.py
0:0 
0:0 
view
42 x 2 sockeye
sockeye
data_io_pt.py
data_io.py
0:0 
0:0 
view
37 x 2 sockeye
sockeye
data_io_pt.py
data_io.py
0:0 
0:0 
view
38 x 2 sockeye
sockeye
data_io_pt.py
data_io.py
0:0 
0:0 
view
19 x 2 sockeye
sockeye
data_io_pt.py
data_io.py
0:0 
0:0 
view
18 x 2 sockeye
sockeye
data_io_pt.py
data_io.py
0:0 
0:0 
view
25 x 2 sockeye
sockeye
inference.py
inference_pt.py
0:0 
0:0 
view
25 x 2 sockeye
sockeye
inference.py
inference_pt.py
0:0 
0:0 
view
17 x 2 sockeye
sockeye
data_io_pt.py
data_io.py
0:0 
0:0 
view
15 x 2 sockeye
sockeye
data_io_pt.py
data_io.py
0:0 
0:0 
view
18 x 2 sockeye
sockeye
data_io_pt.py
data_io.py
0:0 
0:0 
view
15 x 2 sockeye
sockeye
loss.py
loss_pt.py
0:0 
0:0 
view
26 x 2 sockeye
sockeye
inference.py
inference_pt.py
0:0 
0:0 
view
13 x 2 sockeye
sockeye
data_io_pt.py
data_io.py
0:0 
0:0 
view
20 x 2 sockeye
sockeye
data_io_pt.py
data_io.py
0:0 
0:0 
view