awslabs / gap-text2sql
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 45% duplication:
    • 14,831 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 6,690 duplicated lines
  • 282 duplicates
system45% (6,690 lines)
Duplication per Extension
py45% (6,690 lines)
Duplication per Component (primary)
relogic/pretrainkit/models87% (3,454 lines)
rat-sql-gap/seq2struct/models34% (1,397 lines)
relogic/pretrainkit64% (873 lines)
relogic76% (412 lines)
relogic/pretrainkit/datasets20% (177 lines)
rat-sql-gap/seq2struct/datasets10% (158 lines)
relogic/pretrainkit/scorers60% (68 lines)
rat-sql-gap/seq2struct/commands14% (62 lines)
rat-sql-gap/seq2struct10% (41 lines)
rat-sql-gap/seq2struct/grammars6% (36 lines)
relogic/logickit/utils5% (12 lines)
rat-sql-gap/seq2struct/utils0% (0 lines)
rat-sql-gap/seq2struct/resources0% (0 lines)
rat-sql-gap0% (0 lines)
rat-sql-gap/configs/gap0% (0 lines)
rat-sql-gap/experiments/spider-configs0% (0 lines)
relogic/logickit0% (0 lines)
relogic/logickit/modules0% (0 lines)
relogic/logickit/base0% (0 lines)

Duplication Between Components (50+ lines)

G rat-sql-gap/seq2struct/models rat-sql-gap/seq2struct/models relogic/pretrainkit/models relogic/pretrainkit/models rat-sql-gap/seq2struct/models--relogic/pretrainkit/models 910 rat-sql-gap/seq2struct rat-sql-gap/seq2struct rat-sql-gap/seq2struct--rat-sql-gap/seq2struct/models 58

Download: SVG DOT (open online Graphviz editor)

Open 3D force graph...

Show more details on duplication between components...
Longest Duplicates
The list of 20 longest duplicates.
See data for all 282 duplicates...
Size#FoldersFilesLinesCode
226 x 2 relogic/pretrainkit/models/relationalsemparse
relogic/pretrainkit/models/semparse
relational_semparse.py
semparse.py
809:1172 (32%)
809:1172 (32%)
view
196 x 2 relogic/pretrainkit/models/relationalsemparse
relogic/pretrainkit/models/semparse
modeling_relational_bart.py
modeling_bart_copy.py
636:894 (28%)
568:827 (24%)
view
155 x 2 relogic/pretrainkit/models/relationalsemparse
relogic/pretrainkit/models/semparse
modeling_relational_bart.py
modeling_bart_copy.py
418:632 (22%)
352:566 (19%)
view
122 x 2 relogic/pretrainkit/models/relationalsemparse
relogic/pretrainkit/models/semparse
modeling_relational_bart.py
modeling_bart_copy.py
33:252 (17%)
34:253 (15%)
view
105 x 2 rat-sql-gap/seq2struct/models/spider
relogic/pretrainkit/models/relationalsemparse
spider_enc_modules.py
relational_transformer.py
312:449 (20%)
414:553 (20%)
view
105 x 2 rat-sql-gap/seq2struct/models/spider
relogic/pretrainkit/models/relationalsemparse
spider_enc_modules.py
relational_transformer.py
585:700 (20%)
639:754 (20%)
view
103 x 2 relogic/pretrainkit
relogic/pretrainkit
multitask_trainer.py
trainer.py
572:716 (17%)
551:695 (17%)
view
98 x 2 rat-sql-gap/seq2struct/models/spider
rat-sql-gap/seq2struct/models/spider
spider_enc.py
spider_enc.py
866:985 (8%)
1409:1527 (8%)
view
95 x 2 relogic/pretrainkit/models/relationalsemparse
relogic/pretrainkit/models/semparse
relational_semparse.py
semparse.py
207:427 (13%)
214:434 (13%)
view
76 x 2 rat-sql-gap/seq2struct/models
relogic/pretrainkit/models/relationalsemparse
transformer.py
relational_transformer.py
213:329 (42%)
225:345 (14%)
view
60 x 2 relogic
relogic
entity-to-text-train.py
sql-to-text-train.py
31:112 (37%)
31:112 (38%)
view
53 x 2 relogic/pretrainkit/models/relationalsemparse
relogic/pretrainkit/models/semparse
relational_semparse.py
semparse.py
669:756 (7%)
668:755 (7%)
view
52 x 2 relogic
relogic
entity-to-text-train.py
sql-to-text-train.py
127:240 (32%)
123:236 (33%)
view
50 x 2 rat-sql-gap/seq2struct/models
relogic/pretrainkit/models/relationalsemparse
transformer.py
relational_transformer.py
105:171 (27%)
114:182 (9%)
view
49 x 2 rat-sql-gap/seq2struct/datasets
rat-sql-gap/seq2struct/datasets
spider.py
spider.py
56:112 (21%)
119:175 (21%)
view
46 x 2 relogic/pretrainkit/models/multid
relogic/pretrainkit/models
multid.py
sql_to_text.py
14:84 (93%)
15:84 (93%)
view
45 x 2 relogic/pretrainkit
relogic/pretrainkit
multitask_trainer.py
trainer.py
63:147 (7%)
59:143 (7%)
view
43 x 2 relogic/pretrainkit
relogic/pretrainkit
multitask_trainer.py
trainer.py
803:870 (7%)
777:832 (7%)
view
42 x 2 relogic/pretrainkit/models/relationalsemparse
relogic/pretrainkit/models/semparse
relational_semparse.py
semparse.py
432:491 (6%)
437:496 (6%)
view
41 x 2 relogic/pretrainkit
relogic/pretrainkit
multitask_trainer.py
trainer.py
873:941 (6%)
835:903 (7%)
view
Duplicated Units
The list of top 20 duplicated units.
See data for all 60 unit duplicates...
Size#FoldersFilesLinesCode
34 x 2 relogic/pretrainkit/models
relogic/pretrainkit/models/multid
sql_to_text.py
multid.py
0:0 
0:0 
view
28 x 2 relogic/pretrainkit
relogic/pretrainkit
multitask_trainer.py
trainer.py
0:0 
0:0 
view
25 x 2 relogic/pretrainkit/models/relationalsemparse
relogic/pretrainkit/models/semparse
modeling_relational_bart.py
modeling_bart_copy.py
0:0 
0:0 
view
24 x 2 relogic/pretrainkit/models/relationalsemparse
relogic/pretrainkit/models/semparse
modeling_relational_bart.py
modeling_bart_copy.py
0:0 
0:0 
view
25 x 2 relogic/pretrainkit/models/relationalsemparse
relogic/pretrainkit/models/semparse
relational_semparse.py
semparse.py
0:0 
0:0 
view
31 x 2 relogic/pretrainkit/models/relationalsemparse
relogic/pretrainkit/models/semparse
modeling_relational_bart.py
modeling_bart_copy.py
0:0 
0:0 
view
21 x 2 relogic/pretrainkit/models/relationalsemparse
relogic/pretrainkit/models/semparse
modeling_relational_bart.py
modeling_bart_copy.py
0:0 
0:0 
view
29 x 2 rat-sql-gap/seq2struct/models
relogic/pretrainkit/models/relationalsemparse
transformer.py
relational_transformer.py
0:0 
0:0 
view
16 x 2 rat-sql-gap/seq2struct/models/spider
rat-sql-gap/seq2struct/models/spider
spider_enc.py
spider_enc.py
0:0 
0:0 
view
16 x 2 rat-sql-gap/seq2struct/models
relogic/pretrainkit/models/relationalsemparse
transformer.py
relational_transformer.py
0:0 
0:0 
view
18 x 2 relogic/pretrainkit/models/relationalsemparse
relogic/pretrainkit/models/semparse
relational_semparse.py
semparse.py
0:0 
0:0 
view
29 x 2 relogic/pretrainkit/models/relationalsemparse
relogic/pretrainkit/models/semparse
relational_semparse.py
semparse.py
0:0 
0:0 
view
18 x 2 rat-sql-gap/seq2struct/models
relogic/pretrainkit/models/relationalsemparse
transformer.py
relational_transformer.py
0:0 
0:0 
view
14 x 2 relogic/pretrainkit/models/relationalsemparse
relogic/pretrainkit/models/semparse
modeling_relational_bart.py
modeling_bart_copy.py
0:0 
0:0 
view
13 x 2 relogic/pretrainkit
relogic/pretrainkit
multitask_trainer.py
trainer.py
0:0 
0:0 
view
13 x 2 relogic/pretrainkit
relogic/pretrainkit
multitask_trainer.py
trainer.py
0:0 
0:0 
view
12 x 2 relogic/pretrainkit
relogic/pretrainkit
multitask_trainer.py
trainer.py
0:0 
0:0 
view
12 x 3 relogic/pretrainkit/models/relationalsemparse
relogic/pretrainkit/models/semparse
relogic/pretrainkit/models/semparse
relational_semparse.py
modeling_bart_copy.py
semparse.py
0:0 
0:0 
0:0 
view
11 x 2 relogic/pretrainkit/models/relationalsemparse
relogic/pretrainkit/models/semparse
modeling_relational_bart.py
modeling_bart_copy.py
0:0 
0:0 
view
11 x 2 relogic/pretrainkit
relogic/pretrainkit
multitask_trainer.py
trainer.py
0:0 
0:0 
view