facebookresearch / CCQA
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 16% duplication:
    • 1,071 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 179 duplicated lines
  • 11 duplicates
system16% (179 lines)
Duplication per Extension
py20% (161 lines)
rs6% (18 lines)
Duplication per Component (primary)
python20% (161 lines)
rust/src6% (18 lines)
rust0% (0 lines)
Longest Duplicates
The list of 11 longest duplicates.
See data for all 11 duplicates...
Size#FoldersFilesLinesCode
16 x 2 python
python
closed_book_processing.py
passage_retrieval_processing.py
34:49 (23%)
212:229 (6%)
view
15 x 2 python
python
passage_retrieval_processing.py
passage_retrieval_processing.py
98:112 (6%)
126:140 (6%)
view
12 x 2 python
python
closed_book_processing.py
passage_retrieval_processing.py
73:84 (17%)
263:274 (5%)
view
9 x 2 rust/src
rust/src
main.rs
main.rs
162:173 (3%)
241:251 (3%)
view
8 x 2 python
python
closed_book_processing.py
passage_retrieval_processing.py
20:27 (11%)
23:30 (3%)
view
8 x 2 python
python
closed_book_processing.py
json_duplicate_filter.py
21:28 (11%)
18:25 (5%)
view
7 x 2 python
python
passage_retrieval_processing.py
passage_retrieval_processing.py
82:88 (2%)
116:122 (2%)
view
7 x 2 python
python
json_duplicate_filter.py
passage_retrieval_processing.py
18:24 (5%)
24:30 (2%)
view
6 x 2 python
python
mhtml_to_json.py
mhtml_to_json.py
59:64 (1%)
124:129 (1%)
view
6 x 2 python
python
mhtml_to_json.py
mhtml_to_json.py
68:73 (1%)
133:138 (1%)
view
6 x 2 python
python
mhtml_to_json.py
mhtml_to_json.py
50:55 (1%)
115:120 (1%)
view
Duplicated Units
The list of top 1 duplicated units.
See data for all 1 unit duplicate
Size#FoldersFilesLinesCode
6 x 2 rust/src
rust/src
main.rs
main.rs
95:101 
103:109 
view