facebookresearch / MUSE
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 4% duplication:
    • 1,341 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 58 duplicated lines
  • 4 duplicates
system4% (58 lines)
Duplication per Extension
py4% (58 lines)
Duplication per Component (primary)
ROOT17% (34 lines)
src/evaluation4% (24 lines)
src0% (0 lines)
Longest Duplicates
The list of 4 longest duplicates.
See data for all 4 duplicates...
Size#FoldersFilesLinesCode
11 x 2 ROOT
ROOT
supervised.py
unsupervised.py
26:38 (19%)
27:38 (10%)
view
6 x 2 src/evaluation
src/evaluation
wordsim.py
wordsim.py
95:100 (3%)
219:224 (3%)
view
6 x 2 src/evaluation
src/evaluation
word_translation.py
word_translation.py
35:41 (6%)
78:84 (6%)
view
6 x 2 ROOT
ROOT
supervised.py
unsupervised.py
94:105 (10%)
168:179 (5%)
view