facebookresearch / GENRE
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 21% duplication:
    • 4,018 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 878 duplicated lines
  • 140 duplicates
system21% (878 lines)
Duplication per Extension
py21% (878 lines)
Duplication per Component (primary)
scripts_mgenre29% (602 lines)
genre8% (143 lines)
scripts_genre39% (133 lines)
ROOT0% (0 lines)

Duplication Between Components (50+ lines)

G scripts_genre scripts_genre scripts_mgenre scripts_mgenre scripts_genre--scripts_mgenre 436

Download: SVG DOT (open online Graphviz editor)

Open 3D force graph...

Show more details on duplication between components...
Longest Duplicates
The list of 20 longest duplicates.
See data for all 140 duplicates...
Size#FoldersFilesLinesCode
26 x 2 scripts_mgenre
scripts_mgenre
preprocess_wikidata.py
preprocess_wikidata.py
92:121 (9%)
281:311 (9%)
view
21 x 2 scripts_mgenre
scripts_mgenre
preprocess_mgenre.py
preprocess_tries.py
90:113 (7%)
38:61 (12%)
view
20 x 2 scripts_mgenre
scripts_mgenre
preprocess_anchors.py
preprocess_mention_dicts.py
56:77 (11%)
50:71 (17%)
view
20 x 2 scripts_genre
scripts_mgenre
convert_kilt_to_fairseq.py
preprocess_wikidata.py
52:73 (28%)
57:78 (7%)
view
20 x 2 scripts_mgenre
scripts_mgenre
preprocess_mewsli.py
preprocess_wikinews.py
40:61 (20%)
38:59 (19%)
view
20 x 2 scripts_mgenre
scripts_mgenre
preprocess_mention_dicts.py
preprocess_mgenre.py
50:71 (17%)
90:111 (7%)
view
20 x 2 scripts_mgenre
scripts_mgenre
preprocess_anchors.py
preprocess_tries.py
56:77 (11%)
38:59 (11%)
view
20 x 2 scripts_mgenre
scripts_mgenre
preprocess_mgenre.py
preprocess_wikidata.py
90:111 (7%)
57:78 (7%)
view
20 x 2 scripts_mgenre
scripts_mgenre
preprocess_extract.py
preprocess_tries.py
34:55 (29%)
38:59 (11%)
view
20 x 2 scripts_genre
scripts_mgenre
convert_kilt_to_fairseq.py
preprocess_tries.py
52:73 (28%)
38:59 (11%)
view
20 x 2 scripts_genre
scripts_mgenre
convert_kilt_to_fairseq.py
preprocess_mention_dicts.py
52:73 (28%)
50:71 (17%)
view
20 x 2 scripts_genre
scripts_mgenre
convert_kilt_to_fairseq.py
preprocess_anchors.py
52:73 (28%)
56:77 (11%)
view
20 x 2 scripts_mgenre
scripts_mgenre
preprocess_anchors.py
preprocess_wikidata.py
56:77 (11%)
57:78 (7%)
view
20 x 2 scripts_mgenre
scripts_mgenre
preprocess_extract.py
preprocess_mention_dicts.py
34:55 (29%)
50:71 (17%)
view
20 x 2 scripts_genre
scripts_mgenre
convert_kilt_to_fairseq.py
preprocess_extract.py
52:73 (28%)
34:55 (29%)
view
20 x 2 scripts_mgenre
scripts_mgenre
preprocess_mention_dicts.py
preprocess_tries.py
50:71 (17%)
38:59 (11%)
view
20 x 2 scripts_mgenre
scripts_mgenre
preprocess_anchors.py
preprocess_extract.py
56:77 (11%)
34:55 (29%)
view
20 x 2 scripts_mgenre
scripts_mgenre
preprocess_anchors.py
preprocess_mgenre.py
56:77 (11%)
90:111 (7%)
view
20 x 2 scripts_mgenre
scripts_mgenre
preprocess_mention_dicts.py
preprocess_wikidata.py
50:71 (17%)
57:78 (7%)
view
20 x 2 scripts_genre
scripts_mgenre
convert_kilt_to_fairseq.py
preprocess_mgenre.py
52:73 (28%)
90:111 (7%)
view
Duplicated Units
The list of top 2 duplicated units.
See data for all 2 unit duplicates...
Size#FoldersFilesLinesCode
9 x 2 genre
genre
entity_linking.py
entity_linking.py
0:0 
0:0 
view
6 x 2 genre
genre
utils.py
utils.py
0:0 
0:0 
view