facebookresearch / access
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 2% duplication:
    • 1,391 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 32 duplicated lines
  • 2 duplicates
system2% (32 lines)
Duplication per Extension
py2% (32 lines)
Duplication per Component (primary)
scripts28% (20 lines)
access/fairseq3% (12 lines)
access0% (0 lines)
access/resources0% (0 lines)
access/utils0% (0 lines)
access/evaluation0% (0 lines)
ROOT0% (0 lines)
Longest Duplicates
The list of 2 longest duplicates.
See data for all 2 duplicates...
Size#FoldersFilesLinesCode
10 x 2 scripts
scripts
evaluate.py
generate.py
17:26 (71%)
23:32 (55%)
view
6 x 2 access/fairseq
access/fairseq
base.py
base.py
184:189 (2%)
262:267 (2%)
view