facebookresearch / GENRE
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 21 files with 4,190 lines of code.
    • 1 very long files (1,096 lines of code)
    • 0 long files (0 lines of code)
    • 5 medium size files (1,575 lines of codeclsfd_ftr_w_mp_ins)
    • 8 small files (1,111 lines of code)
    • 7 very small files (408 lines of code)
26% | 0% | 37% | 26% | 9%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py26% | 0% | 37% | 26% | 9%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
genre65% | 0% | 14% | 17% | 2%
scripts_mgenre0% | 0% | 49% | 38% | 12%
scripts_genre0% | 0% | 79% | 0% | 20%
ROOT0% | 0% | 0% | 0% | 100%
Longest Files (Top 21)
File# lines# units
utils.py
in genre
1096 24
evaluate_kilt_dataset.py
in scripts_mgenre
499 2
evaluate_kilt_dataset.py
in scripts_genre
283 2
preprocess_wikidata.py
in scripts_mgenre
278 -
preprocess_mgenre.py
in scripts_mgenre
277 -
entity_linking.py
in genre
238 10
preprocess_anchors.py
in scripts_mgenre
181 1
preprocess_tries.py
in scripts_mgenre
180 -
trie.py
in genre
153 19
fairseq_model.py
in genre
134 5
preprocess_mention_dicts.py
in scripts_mgenre
126 -
preprocess_TR2016.py
in scripts_mgenre
115 -
preprocess_wikinews.py
in scripts_mgenre
114 -
preprocess_mewsli.py
in scripts_mgenre
108 -
evaluate_mel.py
in scripts_mgenre
96 1
preprocess_sentencepiece.py
in scripts_mgenre
96 4
convert_kilt_to_fairseq.py
in scripts_genre
75 1
preprocess_extract.py
in scripts_mgenre
74 -
hf_model.py
in genre
49 3
setup.py
in root
17 -
__init__.py
in genre
1 -
Files With Most Units (Top 11)
File# lines# units
utils.py
in genre
1096 24
trie.py
in genre
153 19
entity_linking.py
in genre
238 10
fairseq_model.py
in genre
134 5
preprocess_sentencepiece.py
in scripts_mgenre
96 4
hf_model.py
in genre
49 3
evaluate_kilt_dataset.py
in scripts_genre
283 2
evaluate_kilt_dataset.py
in scripts_mgenre
499 2
convert_kilt_to_fairseq.py
in scripts_genre
75 1
preprocess_anchors.py
in scripts_mgenre
181 1
evaluate_mel.py
in scripts_mgenre
96 1
Files With Long Lines (Top 3)

There are 3 files with lines longer than 120 characters. In total, there are 3 long lines.

File# lines# units# long lines
preprocess_tries.py
in scripts_mgenre
180 - 1
preprocess_mgenre.py
in scripts_mgenre
277 - 1
evaluate_kilt_dataset.py
in scripts_mgenre
499 2 1