facebookresearch / fastText
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 3% duplication:
    • 6,347 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 200 duplicated lines
  • 20 duplicates
system3% (200 lines)
Duplication per Extension
cc1% (57 lines)
js9% (54 lines)
h5% (40 lines)
py2% (32 lines)
cpp24% (17 lines)
Duplication per Component (primary)
src2% (83 lines)
alignment6% (32 lines)
webassembly5% (24 lines)
website/core17% (18 lines)
scripts/kbcompletion24% (17 lines)
python/fasttext_module1% (14 lines)
website/pages7% (12 lines)
ROOT0% (0 lines)
python/benchmarks0% (0 lines)
website0% (0 lines)
website/static0% (0 lines)
crawl0% (0 lines)
Longest Duplicates
The list of 20 longest duplicates.
See data for all 20 duplicates...
Size#FoldersFilesLinesCode
17 x 2 scripts/kbcompletion
src
eval.cpp
dictionary.cc
20:40 (24%)
210:231 (4%)
view
9 x 2 alignment
alignment
unsup_align.py
unsup_multialign.py
67:75 (11%)
156:164 (6%)
view
8 x 2 webassembly
webassembly
fasttext.js
fasttext.js
61:69 (4%)
499:507 (4%)
view
8 x 2 webassembly
webassembly
fasttext.js
fasttext.js
61:69 (4%)
81:89 (4%)
view
8 x 2 webassembly
webassembly
fasttext.js
fasttext.js
81:89 (4%)
499:507 (4%)
view
7 x 2 alignment
alignment
align.py
unsup_multialign.py
45:51 (8%)
39:45 (4%)
view
7 x 2 src
src
cc
productquantizer.cc
productquantizer.cc
183:190 (4%)
202:209 (4%)
view
7 x 2 src
src
loss.h
loss.h
77:83 (5%)
101:107 (5%)
view
7 x 2 python/fasttext_module/fasttext/pybind
python/fasttext_module/fasttext/pybind
cc
fasttext_pybind.cc
fasttext_pybind.cc
273:280 (1%)
444:451 (1%)
view
7 x 2 src
src
densematrix.h
quantmatrix.h
70:76 (14%)
51:57 (25%)
view
6 x 2 src
src
loss.h
loss.h
77:82 (4%)
137:142 (4%)
view
6 x 2 website/pages/en
website/pages/en
index.js
index.js
103:108 (3%)
134:139 (3%)
view
6 x 2 website/core
website/core
Footer.js
Footer.js
56:61 (5%)
66:71 (5%)
view
6 x 2 website/core
website/core
Footer.js
Footer.js
46:51 (5%)
56:61 (5%)
view
6 x 2 src
src
loss.h
loss.h
101:106 (4%)
137:142 (4%)
view
6 x 2 src
src
cc
main.cc
main.cc
304:309 (1%)
327:332 (1%)
view
6 x 2 src
src
loss.h
loss.h
137:142 (4%)
154:159 (4%)
view
6 x 2 src
src
loss.h
loss.h
77:82 (4%)
154:159 (4%)
view
6 x 2 website/core
website/core
Footer.js
Footer.js
46:51 (5%)
66:71 (5%)
view
6 x 2 src
src
loss.h
loss.h
101:106 (4%)
154:159 (4%)
view
Duplicated Units
The list of top 2 duplicated units.
See data for all 2 unit duplicates...
Size#FoldersFilesLinesCode
8 x 2 src
src
cc
main.cc
main.cc
50:58 
60:68 
view
6 x 2 alignment
alignment
align.py
unsup_multialign.py
0:0 
0:0 
view