facebookresearch / StarSpace
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 29% duplication:
    • 3,968 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 1,167 duplicated lines
  • 191 duplicates
system29% (1,167 lines)
Duplication per Extension
vcxproj86% (909 lines)
cpp8% (180 lines)
filters56% (78 lines)
Duplication per Component (primary)
MVS/query_nn94% (179 lines)
MVS/embed_doc94% (179 lines)
MVS/query_predict94% (179 lines)
MVS/print_ngrams94% (179 lines)
MVS/starspace93% (178 lines)
src7% (152 lines)
MVS38% (93 lines)
src/apps25% (28 lines)
python0% (0 lines)
src/utils0% (0 lines)

Duplication Between Components (50+ lines)

G MVS/embed_doc MVS/embed_doc MVS/query_nn MVS/query_nn MVS/embed_doc--MVS/query_nn 358 MVS/print_ngrams MVS/print_ngrams MVS/embed_doc--MVS/print_ngrams 358 MVS/query_predict MVS/query_predict MVS/embed_doc--MVS/query_predict 358 MVS/starspace MVS/starspace MVS/embed_doc--MVS/starspace 353 MVS/query_nn--MVS/query_predict 358 MVS/query_nn--MVS/starspace 353 MVS/print_ngrams--MVS/query_nn 358 MVS/print_ngrams--MVS/query_predict 358 MVS/print_ngrams--MVS/starspace 353 MVS/query_predict--MVS/starspace 353 MVS MVS MVS--MVS/embed_doc 144 MVS--MVS/query_nn 144 MVS--MVS/print_ngrams 144 MVS--MVS/query_predict 144 MVS--MVS/starspace 166

Download: SVG DOT (open online Graphviz editor)

Open 3D force graph...

Show more details on duplication between components...
Longest Duplicates
The list of 20 longest duplicates.
See data for all 191 duplicates...
Size#FoldersFilesLinesCode
144 x 2 MVS/embed_doc
MVS/query_nn
vcxproj
embed_doc.vcxproj
query_nn.vcxproj
33:221 (82%)
33:221 (82%)
view
144 x 2 MVS/embed_doc
MVS/print_ngrams
vcxproj
embed_doc.vcxproj
print_ngrams.vcxproj
33:221 (82%)
33:221 (82%)
view
144 x 2 MVS/print_ngrams
MVS/query_predict
vcxproj
print_ngrams.vcxproj
query_predict.vcxproj
33:221 (82%)
33:221 (82%)
view
144 x 2 MVS/embed_doc
MVS/query_predict
vcxproj
embed_doc.vcxproj
query_predict.vcxproj
33:221 (82%)
33:221 (82%)
view
144 x 2 MVS/query_nn
MVS/query_predict
vcxproj
query_nn.vcxproj
query_predict.vcxproj
33:221 (82%)
33:221 (82%)
view
144 x 2 MVS/print_ngrams
MVS/query_nn
vcxproj
print_ngrams.vcxproj
query_nn.vcxproj
33:221 (82%)
33:221 (82%)
view
75 x 2 MVS/embed_doc
MVS/starspace
vcxproj
embed_doc.vcxproj
starspace.vcxproj
33:132 (42%)
33:132 (42%)
view
75 x 2 MVS/print_ngrams
MVS/starspace
vcxproj
print_ngrams.vcxproj
starspace.vcxproj
33:132 (42%)
33:132 (42%)
view
75 x 2 MVS/query_predict
MVS/starspace
vcxproj
query_predict.vcxproj
starspace.vcxproj
33:132 (42%)
33:132 (42%)
view
75 x 2 MVS/query_nn
MVS/starspace
vcxproj
query_nn.vcxproj
starspace.vcxproj
33:132 (42%)
33:132 (42%)
view
43 x 2 src
src
data.cpp
doc_data.cpp
36:86 (22%)
27:78 (29%)
view
37 x 2 MVS/query_predict
MVS/starspace
vcxproj
query_predict.vcxproj
starspace.vcxproj
135:182 (21%)
134:181 (21%)
view
37 x 2 MVS/query_nn
MVS/starspace
vcxproj
query_nn.vcxproj
starspace.vcxproj
135:182 (21%)
134:181 (21%)
view
37 x 2 MVS/embed_doc
MVS/starspace
vcxproj
embed_doc.vcxproj
starspace.vcxproj
135:182 (21%)
134:181 (21%)
view
37 x 2 MVS/print_ngrams
MVS/starspace
vcxproj
print_ngrams.vcxproj
starspace.vcxproj
135:182 (21%)
134:181 (21%)
view
22 x 2 MVS/query_nn
MVS/query_predict
vcxproj
query_nn.vcxproj
query_predict.vcxproj
1:29 (12%)
1:29 (12%)
view
22 x 2 MVS
MVS/query_predict
vcxproj
StarSpaceLib.vcxproj
query_predict.vcxproj
1:29 (12%)
1:29 (12%)
view
22 x 2 MVS/query_predict
MVS/starspace
vcxproj
query_predict.vcxproj
starspace.vcxproj
1:29 (12%)
1:29 (12%)
view
22 x 2 MVS/query_nn
MVS/starspace
vcxproj
query_nn.vcxproj
starspace.vcxproj
1:29 (12%)
1:29 (12%)
view
22 x 2 MVS/embed_doc
MVS/query_nn
vcxproj
embed_doc.vcxproj
query_nn.vcxproj
1:29 (12%)
1:29 (12%)
view
Duplicated Units
The list of top 1 duplicated units.
See data for all 1 unit duplicate
Size#FoldersFilesLinesCode
50 x 2 src
src
doc_data.cpp
data.cpp
26:80 
35:88 
view