facebookresearch / UnsupervisedMT
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 34 files with 4,623 lines of code.
    • 0 very long files (0 lines of code)
    • 2 long files (1,210 lines of code)
    • 6 medium size files (1,827 lines of codeclsfd_ftr_w_mp_ins)
    • 6 small files (848 lines of code)
    • 20 very small files (738 lines of code)
0% | 26% | 39% | 18% | 15%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 26% | 39% | 18% | 15%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
NMT/src0% | 37% | 31% | 25% | 5%
NMT/src/model0% | 38% | 43% | 0% | 17%
NMT/src/data0% | 0% | 54% | 45% | <1%
NMT0% | 0% | 90% | 0% | 9%
NMT/src/modules0% | 0% | 0% | 55% | 45%
PBSMT/src0% | 0% | 0% | 0% | 100%
PBSMT0% | 0% | 0% | 0% | 100%
Longest Files (Top 34)
File# lines# units
trainer.py
in NMT/src
617 31
attention.py
in NMT/src/model
593 11
transformer.py
in NMT/src/model
347 20
loader.py
in NMT/src/data
346 9
seq2seq.py
in NMT/src/model
336 7
main.py
in NMT
283 2
sequence_generator.py
in NMT/src
266 4
utils.py
in NMT/src
249 15
evaluator.py
in NMT/src
183 10
dataset.py
in NMT/src/data
167 16
multihead_attention.py
in NMT/src/modules
143 11
multiprocessing_event_loop.py
in NMT/src
128 13
dictionary.py
in NMT/src/data
117 10
test.py
in NMT/src
110 3
pretrain_embeddings.py
in NMT/src/model
98 4
create-phrase-table.py
in PBSMT
94 -
lm.py
in NMT/src/model
86 4
__init__.py
in NMT/src/model
68 2
sinusoidal_positional_embedding.py
in NMT/src/modules
54 4
utils.py
in PBSMT/src
46 3
loader.py
in PBSMT/src
43 4
logger.py
in NMT/src
36 3
layer_norm.py
in NMT/src/modules
35 2
dictionary.py
in PBSMT/src
32 8
preprocess.py
in NMT
30 -
adam_inverse_sqrt_with_warmup.py
in NMT/src
28 3
label_smoothed_cross_entropy.py
in NMT/src/modules
27 2
discriminator.py
in NMT/src/model
25 2
fairseq_utils.py
in NMT/src
17 3
gumbel.py
in NMT/src
15 3
__init__.py
in PBSMT/src
1 -
__init__.py
in NMT/src
1 -
__init__.py
in NMT/src/modules
1 -
__init__.py
in NMT/src/data
1 -
Files With Most Units (Top 20)
File# lines# units
trainer.py
in NMT/src
617 31
transformer.py
in NMT/src/model
347 20
dataset.py
in NMT/src/data
167 16
utils.py
in NMT/src
249 15
multiprocessing_event_loop.py
in NMT/src
128 13
attention.py
in NMT/src/model
593 11
multihead_attention.py
in NMT/src/modules
143 11
dictionary.py
in NMT/src/data
117 10
evaluator.py
in NMT/src
183 10
loader.py
in NMT/src/data
346 9
dictionary.py
in PBSMT/src
32 8
seq2seq.py
in NMT/src/model
336 7
loader.py
in PBSMT/src
43 4
sequence_generator.py
in NMT/src
266 4
pretrain_embeddings.py
in NMT/src/model
98 4
lm.py
in NMT/src/model
86 4
sinusoidal_positional_embedding.py
in NMT/src/modules
54 4
utils.py
in PBSMT/src
46 3
fairseq_utils.py
in NMT/src
17 3
test.py
in NMT/src
110 3
Files With Long Lines (Top 10)

There are 10 files with lines longer than 120 characters. In total, there are 27 long lines.

File# lines# units# long lines
attention.py
in NMT/src/model
593 11 6
loader.py
in NMT/src/data
346 9 6
main.py
in NMT
283 2 3
__init__.py
in NMT/src/model
68 2 3
test.py
in NMT/src
110 3 2
seq2seq.py
in NMT/src/model
336 7 2
trainer.py
in NMT/src
617 31 2
create-phrase-table.py
in PBSMT
94 - 1
pretrain_embeddings.py
in NMT/src/model
98 4 1
transformer.py
in NMT/src/model
347 20 1