facebookresearch / simmc2
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 21% duplication:
    • 2,512 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 531 duplicated lines
  • 26 duplicates
system21% (531 lines)
Duplication per Extension
py21% (531 lines)
Duplication per Component (primary)
mm_dst/gpt2_dst/scripts29% (445 lines)
utils17% (52 lines)
mm_dst/utils12% (28 lines)
disambiguate2% (6 lines)
mm_dst/gpt2_dst/utils0% (0 lines)
Longest Duplicates
The list of 20 longest duplicates.
See data for all 26 duplicates...
Size#FoldersFilesLinesCode
112 x 2 model/mm_dst/gpt2_dst/scripts
model/mm_dst/gpt2_dst/scripts
run_generation.py
run_retrieval.py
30:194 (41%)
31:195 (54%)
view
43 x 2 model/mm_dst/gpt2_dst/scripts
model/mm_dst/gpt2_dst/scripts
run_generation.py
run_retrieval.py
240:291 (15%)
213:264 (20%)
view
19 x 2 model/mm_dst/gpt2_dst/scripts
model/mm_dst/gpt2_dst/scripts
run_generation.py
run_retrieval.py
216:235 (6%)
195:214 (9%)
view
18 x 2 model/mm_dst/gpt2_dst/scripts
model/mm_dst/gpt2_dst/scripts
run_generation.py
run_retrieval.py
296:315 (6%)
270:289 (8%)
view
16 x 2 model/mm_dst/gpt2_dst/scripts
model/mm_dst/gpt2_dst/scripts
evaluate.py
evaluate_response.py
22:39 (76%)
43:60 (36%)
view
13 x 2 model/utils
model/utils
disambiguator_evaluation.py
response_evaluation.py
96:108 (20%)
127:139 (14%)
view
10 x 2 model/mm_dst/utils
model/mm_dst/utils
evaluate_dst.py
evaluate_dst.py
220:235 (4%)
240:250 (4%)
view
8 x 2 model/mm_dst/gpt2_dst/scripts
model/mm_dst/utils
evaluate_response.py
evaluate_dst.py
52:60 (18%)
379:387 (3%)
view
8 x 2 model/mm_dst/gpt2_dst/scripts
model/mm_dst/utils
evaluate.py
evaluate_dst.py
31:39 (38%)
379:387 (3%)
view
8 x 2 model/utils
model/utils
response_evaluation.py
retrieval_evaluation.py
120:127 (9%)
101:108 (11%)
view
7 x 2 model/mm_dst/gpt2_dst/scripts
model/mm_dst/gpt2_dst/scripts
run_generation.py
run_retrieval.py
372:381 (2%)
293:301 (3%)
view
6 x 2 model/mm_dst/gpt2_dst/scripts
model/utils
format_retrieval_results.py
retrieval_evaluation.py
64:70 (13%)
108:113 (8%)
view
6 x 2 model/utils
model/utils
response_evaluation.py
retrieval_evaluation.py
134:139 (6%)
108:113 (8%)
view
6 x 2 model/utils
model/utils
retrieval_evaluation.py
visualize_bboxes.py
108:113 (8%)
126:131 (7%)
view
6 x 2 model/mm_dst/gpt2_dst/scripts
model/utils
format_retrieval_results.py
visualize_bboxes.py
64:70 (13%)
126:131 (7%)
view
6 x 2 model/utils
model/utils
response_evaluation.py
visualize_bboxes.py
134:139 (6%)
126:131 (7%)
view
6 x 2 model/disambiguate
model/utils
format_disambiguation_data.py
disambiguator_evaluation.py
80:86 (10%)
103:108 (9%)
view
6 x 2 model/mm_dst/gpt2_dst/scripts
model/utils
format_retrieval_results.py
response_evaluation.py
64:70 (13%)
134:139 (6%)
view
6 x 2 model/disambiguate
model/mm_dst/gpt2_dst/scripts
format_disambiguation_data.py
format_retrieval_results.py
80:86 (10%)
64:70 (13%)
view
6 x 2 model/mm_dst/gpt2_dst/scripts
model/utils
format_retrieval_results.py
disambiguator_evaluation.py
64:70 (13%)
103:108 (9%)
view
Duplicated Units
The list of top 4 duplicated units.
See data for all 4 unit duplicates...
Size#FoldersFilesLinesCode
22 x 2 model/mm_dst/gpt2_dst/scripts
model/mm_dst/gpt2_dst/scripts
run_generation.py
run_retrieval.py
0:0 
0:0 
view
23 x 2 model/mm_dst/gpt2_dst/scripts
model/mm_dst/gpt2_dst/scripts
run_generation.py
run_retrieval.py
0:0 
0:0 
view
10 x 2 model/mm_dst/gpt2_dst/scripts
model/mm_dst/gpt2_dst/scripts
run_generation.py
run_retrieval.py
0:0 
0:0 
view
7 x 2 model/mm_dst/gpt2_dst/scripts
model/mm_dst/gpt2_dst/scripts
run_generation.py
run_retrieval.py
0:0 
0:0 
view