facebookresearch / fastText
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 58 files with 7,545 lines of code.
    • 0 very long files (0 lines of code)
    • 1 long files (729 lines of code)
    • 10 medium size files (3,588 lines of codeclsfd_ftr_w_mp_ins)
    • 10 small files (1,427 lines of code)
    • 37 very small files (1,801 lines of code)
0% | 9% | 47% | 18% | 23%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
cc0% | 16% | 67% | 7% | 8%
py0% | 0% | 25% | 40% | 33%
js0% | 0% | 32% | 47% | 20%
h0% | 0% | 0% | 28% | 71%
cpp0% | 0% | 0% | 0% | 100%
pl0% | 0% | 0% | 0% | 100%
css0% | 0% | 0% | 0% | 100%
in0% | 0% | 0% | 0% | 100%
cfg0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
src0% | 16% | 50% | 12% | 20%
python/fasttext_module0% | 0% | 84% | 13% | 2%
webassembly0% | 0% | 100% | 0% | 0%
alignment0% | 0% | 0% | 56% | 43%
website/pages0% | 0% | 0% | 100% | 0%
ROOT0% | 0% | 0% | 37% | 62%
website/core0% | 0% | 0% | 100% | 0%
crawl0% | 0% | 0% | 0% | 100%
scripts/kbcompletion0% | 0% | 0% | 0% | 100%
website0% | 0% | 0% | 0% | 100%
website/static0% | 0% | 0% | 0% | 100%
python/benchmarks0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
cc
fasttext.cc
in src
729 47
cc
fasttext_pybind.cc
in python/fasttext_module/fasttext/pybind
481 4
cc
dictionary.cc
in src
473 34
cc
args.cc
in src
454 21
cc
autotune.cc
in src
418 18
cc
main.cc
in src
400 23
FastText.py
in python/fasttext_module/fasttext
351 45
cc
loss.cc
in src
303 17
cc
fasttext_wasm.cc
in webassembly
281 6
cc
productquantizer.cc
in src
223 12
fasttext.js
in webassembly
204 25
index.js
in website/pages/en
190 3
cc
meter.cc
in src
174 12
setup.py
in root
149 6
unsup_multialign.py
in alignment
147 6
cc
densematrix.cc
in src
145 14
loss.h
in src
134 -
util.py
in python/fasttext_module/fasttext/util
132 8
utils.py
in alignment
124 10
fasttext.h
in src
124 4
Footer.js
in website/core
108 -
cc
quantmatrix.cc
in src
95 9
align.py
in alignment
91 3
eval.cpp
in scripts/kbcompletion
90 2
dictionary.h
in src
90 1
siteConfig.js
in website
87 -
unsup_align.py
in alignment
81 4
args.h
in src
79 -
cc
model.cc
in src
75 6
eval.py
in root
74 2
cc
vector.cc
in src
72 9
meter.h
in src
71 5
autotune.h
in src
70 -
cc
filter_utf8.cc
in crawl
69 8
densematrix.h
in src
62 2
model.h
in src
58 -
reduce_model.py
in root
57 4
utils.h
in src
48 2
wikifil.pl
in root
43 -
vector.h
in src
43 1
productquantizer.h
in src
42 -
cc
dedup.cc
in crawl
41 2
runtests.py
in root
41 1
tabber.js
in website/static
39 7
quantmatrix.h
in src
39 -
fasttext.css
in website/static
37 -
get_word_vector.py
in python/benchmarks
36 1
eval.py
in alignment
36 1
cc
utils.cc
in src
33 4
matrix.h
in src
27 -
Files With Most Units (Top 20)
File# lines# units
cc
fasttext.cc
in src
729 47
FastText.py
in python/fasttext_module/fasttext
351 45
cc
dictionary.cc
in src
473 34
fasttext.js
in webassembly
204 25
cc
main.cc
in src
400 23
cc
args.cc
in src
454 21
cc
autotune.cc
in src
418 18
cc
loss.cc
in src
303 17
cc
densematrix.cc
in src
145 14
cc
productquantizer.cc
in src
223 12
cc
meter.cc
in src
174 12
utils.py
in alignment
124 10
cc
quantmatrix.cc
in src
95 9
cc
vector.cc
in src
72 9
util.py
in python/fasttext_module/fasttext/util
132 8
cc
filter_utf8.cc
in crawl
69 8
tabber.js
in website/static
39 7
setup.py
in root
149 6
cc
fasttext_wasm.cc
in webassembly
281 6
unsup_multialign.py
in alignment
147 6
Files With Long Lines (Top 5)

There are 5 files with lines longer than 120 characters. In total, there are 10 long lines.

File# lines# units# long lines
FastText.py
in python/fasttext_module/fasttext
351 45 3
index.js
in website/pages/en
190 3 2
unsup_multialign.py
in alignment
147 6 2
eval.py
in alignment
36 1 2
utils.py
in alignment
124 10 1