amazon-research / meta-learning-the-difference
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 14% duplication:
    • 6,061 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 889 duplicated lines
  • 32 duplicates
system14% (889 lines)
Duplication per Extension
py15% (889 lines)
Duplication per Component (primary)
abstractive_summarization/src56% (572 lines)
low_rank_comparisons/src6% (119 lines)
dialogue_personalization/utils6% (96 lines)
dialogue_personalization/model8% (59 lines)
dialogue_personalization7% (22 lines)
abstractive_summarization/src/others3% (21 lines)
low_rank_comparisons/eval0% (0 lines)

Duplication Between Components (50+ lines)

G dialogue_personalization/model dialogue_personalization/model dialogue_personalization/utils dialogue_personalization/utils dialogue_personalization/model--dialogue_personalization/utils 111

Download: SVG DOT (open online Graphviz editor)

Open 3D force graph...

Show more details on duplication between components...
Longest Duplicates
The list of 20 longest duplicates.
See data for all 32 duplicates...
Size#FoldersFilesLinesCode
143 x 2 abstractive_summarization/src
abstractive_summarization/src
dapt_pretraining.py
tapt_pretraining.py
16:194 (71%)
16:194 (71%)
view
56 x 2 abstractive_summarization/src
abstractive_summarization/src
dapt_pretraining.py
tapt_pretraining.py
196:266 (28%)
196:266 (28%)
view
43 x 2 abstractive_summarization/src
abstractive_summarization/src
sdpt_pretraining.py
trainer.py
20:71 (37%)
11:62 (28%)
view
37 x 2 dialogue_personalization/utils
dialogue_personalization/utils
beam_omt.py
beam_ptr.py
241:287 (19%)
192:238 (20%)
view
25 x 2 dialogue_personalization/model
dialogue_personalization/utils
common_layer.py
beam_omt.py
540:573 (6%)
254:287 (13%)
view
25 x 2 dialogue_personalization/model
dialogue_personalization/utils
common_layer.py
beam_ptr.py
540:573 (6%)
205:238 (13%)
view
13 x 2 abstractive_summarization/src
abstractive_summarization/src
run.py
sdpt_pretraining.py
88:102 (10%)
95:109 (11%)
view
13 x 2 abstractive_summarization/src/others
low_rank_comparisons/src
recadam.py
optimizer.py
79:98 (21%)
84:103 (6%)
view
12 x 2 abstractive_summarization/src
abstractive_summarization/src
dapt_pretraining.py
run.py
222:234 (6%)
106:117 (10%)
view
12 x 2 abstractive_summarization/src
abstractive_summarization/src
sdpt_pretraining.py
tapt_pretraining.py
113:124 (10%)
222:234 (6%)
view
12 x 2 abstractive_summarization/src
abstractive_summarization/src
run.py
sdpt_pretraining.py
106:117 (10%)
113:124 (10%)
view
12 x 2 abstractive_summarization/src
abstractive_summarization/src
run.py
sdpt_pretraining.py
71:83 (10%)
78:90 (10%)
view
12 x 2 dialogue_personalization/model
dialogue_personalization/utils
common_layer.py
beam_omt.py
593:604 (3%)
241:252 (6%)
view
12 x 2 abstractive_summarization/src
abstractive_summarization/src
run.py
tapt_pretraining.py
106:117 (10%)
222:234 (6%)
view
12 x 2 abstractive_summarization/src
abstractive_summarization/src
dapt_pretraining.py
sdpt_pretraining.py
222:234 (6%)
113:124 (10%)
view
12 x 2 dialogue_personalization/model
dialogue_personalization/utils
common_layer.py
beam_ptr.py
593:604 (3%)
192:203 (6%)
view
11 x 2 dialogue_personalization/utils
dialogue_personalization/utils
data_reader.py
data_reader.py
430:442 (2%)
471:483 (2%)
view
10 x 2 dialogue_personalization
dialogue_personalization/model
generate_samples.py
common_layer.py
27:36 (20%)
633:644 (2%)
view
9 x 2 low_rank_comparisons/src
low_rank_comparisons/src
data_utils.py
data_utils.py
55:64 (4%)
126:135 (4%)
view
8 x 2 low_rank_comparisons/src
low_rank_comparisons/src
gpt2_beam.py
gpt2_ft.py
19:30 (3%)
14:23 (3%)
view
Duplicated Units
The list of top 8 duplicated units.
See data for all 8 unit duplicates...
Size#FoldersFilesLinesCode
59 x 2 abstractive_summarization/src
abstractive_summarization/src
dapt_pretraining.py
tapt_pretraining.py
0:0 
0:0 
view
34 x 2 abstractive_summarization/src
abstractive_summarization/src
dapt_pretraining.py
tapt_pretraining.py
0:0 
0:0 
view
25 x 3 dialogue_personalization/model
dialogue_personalization/utils
dialogue_personalization/utils
common_layer.py
beam_ptr.py
beam_omt.py
0:0 
0:0 
0:0 
view
19 x 2 abstractive_summarization/src
abstractive_summarization/src
dapt_pretraining.py
tapt_pretraining.py
0:0 
0:0 
view
11 x 3 dialogue_personalization/model
dialogue_personalization/utils
dialogue_personalization/utils
common_layer.py
beam_ptr.py
beam_omt.py
0:0 
0:0 
0:0 
view
10 x 2 abstractive_summarization/src
abstractive_summarization/src
dapt_pretraining.py
tapt_pretraining.py
0:0 
0:0 
view
7 x 2 abstractive_summarization/src
abstractive_summarization/src
dapt_pretraining.py
tapt_pretraining.py
0:0 
0:0 
view
7 x 2 abstractive_summarization/src
abstractive_summarization/src
dapt_pretraining.py
tapt_pretraining.py
0:0 
0:0 
view