facebookresearch / SentEval
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 5% duplication:
    • 1,804 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 104 duplicated lines
  • 5 duplicates
system5% (104 lines)
Duplication per Extension
py6% (104 lines)
Duplication per Component (primary)
senteval6% (60 lines)
senteval/tools6% (44 lines)
ROOT0% (0 lines)
data/downstream0% (0 lines)
Longest Duplicates
The list of 5 longest duplicates.
See data for all 5 duplicates...
Size#FoldersFilesLinesCode
20 x 2 senteval
senteval
sick.py
sick.py
56:80 (12%)
160:184 (12%)
view
10 x 2 senteval/tools
senteval/tools
ranking.py
ranking.py
279:290 (3%)
319:330 (3%)
view
10 x 2 senteval
senteval
sick.py
sick.py
43:52 (6%)
148:157 (6%)
view
6 x 2 senteval/tools
senteval/tools
ranking.py
relatedness.py
194:199 (2%)
92:97 (6%)
view
6 x 2 senteval/tools
senteval/tools
classifier.py
relatedness.py
144:149 (4%)
124:129 (6%)
view