facebookresearch / curiosity
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 39% duplication:
    • 2,953 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 1,173 duplicated lines
  • 50 duplicates
system39% (1,173 lines)
Duplication per Extension
py33% (830 lines)
jsonnet74% (307 lines)
yaml92% (36 lines)
Duplication per Component (primary)
curiosity33% (830 lines)
configs74% (307 lines)
ROOT57% (36 lines)
Longest Duplicates
The list of 20 longest duplicates.
See data for all 50 duplicates...
Size#FoldersFilesLinesCode
76 x 2 curiosity
curiosity
baseline_reader.py
baseline_reader.py
226:308 (25%)
392:474 (25%)
view
61 x 2 curiosity
curiosity
baseline_reader.py
baseline_reader.py
147:224 (20%)
313:390 (20%)
view
54 x 2 configs
configs
baseline_model.jsonnet
model.jsonnet
1:58 (41%)
1:58 (41%)
view
50 x 2 configs
configs
baseline_model.jsonnet
model.jsonnet
86:135 (38%)
86:135 (38%)
view
31 x 2 curiosity
curiosity
stats.py
stats.py
159:201 (10%)
273:315 (10%)
view
26 x 2 curiosity
curiosity
baseline_models.py
models.py
267:308 (8%)
325:366 (7%)
view
19 x 2 curiosity
curiosity
stats.py
stats.py
113:135 (6%)
227:249 (6%)
view
19 x 2 configs
configs
paraphrase_curiosity_seq2seq.jsonnet
paraphrase_wow_seq2seq.jsonnet
24:42 (25%)
23:41 (24%)
view
18 x 2 curiosity
curiosity
stats.py
stats.py
204:223 (6%)
318:337 (6%)
view
14 x 2 curiosity
curiosity
baseline_models.py
models.py
333:349 (4%)
389:405 (4%)
view
14 x 2 curiosity
curiosity
baseline_models.py
models.py
34:51 (4%)
34:51 (4%)
view
13 x 2 curiosity
curiosity
baseline_reader.py
reader.py
192:206 (4%)
193:206 (4%)
view
13 x 2 curiosity
curiosity
baseline_reader.py
reader.py
358:372 (4%)
193:206 (4%)
view
12 x 2 ROOT
ROOT
environment.yaml
environment_gpu.yaml
8:19 (63%)
9:20 (60%)
view
11 x 2 curiosity
curiosity
reader.py
reader.py
170:189 (3%)
323:340 (3%)
view
11 x 2 configs
configs
baseline_model.jsonnet
model.jsonnet
73:83 (8%)
73:83 (8%)
view
11 x 2 curiosity
curiosity
baseline_models.py
models.py
64:74 (3%)
65:75 (3%)
view
10 x 2 curiosity
curiosity
baseline_models.py
models.py
76:86 (3%)
78:88 (2%)
view
10 x 2 configs
configs
paraphrase_curiosity_seq2seq.jsonnet
paraphrase_wow_seq2seq.jsonnet
61:70 (13%)
63:72 (12%)
view
9 x 2 configs
configs
paraphrase_curiosity_seq2seq.jsonnet
paraphrase_wow_seq2seq.jsonnet
52:60 (12%)
51:59 (11%)
view
Duplicated Units
The list of top 3 duplicated units.
See data for all 3 unit duplicates...
Size#FoldersFilesLinesCode
16 x 2 curiosity
curiosity
baseline_reader.py
baseline_reader.py
0:0 
0:0 
view
26 x 2 curiosity
curiosity
baseline_reader.py
baseline_reader.py
0:0 
0:0 
view
6 x 2 curiosity
curiosity
stats.py
stats.py
0:0 
0:0 
view