facebookresearch / MultipleAttributeTextRewriting
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 49 files with 5,377 lines of code.
    • 0 very long files (0 lines of code)
    • 1 long files (750 lines of code)
    • 7 medium size files (2,224 lines of codeclsfd_ftr_w_mp_ins)
    • 8 small files (1,153 lines of code)
    • 33 very small files (1,250 lines of code)
0% | 13% | 41% | 21% | 23%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 13% | 41% | 21% | 23%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
code/src0% | 37% | 48% | 6% | 7%
code/src/model0% | 0% | 70% | 9% | 20%
code0% | 0% | 51% | 0% | 48%
code/src/data0% | 0% | 0% | 99% | <1%
data/Amazon0% | 0% | 0% | 63% | 36%
data/Yelp0% | 0% | 0% | 60% | 39%
code/src/modules0% | 0% | 0% | 55% | 45%
data0% | 0% | 0% | 0% | 100%
code/scripts0% | 0% | 0% | 0% | 100%
Longest Files (Top 49)
File# lines# units
trainer.py
in code/src
750 54
evaluator.py
in code/src
477 19
attention.py
in code/src/model
398 12
transformer.py
in code/src/model
329 22
seq2seq.py
in code/src/model
274 9
sequence_generator.py
in code/src
266 4
main-parallel.py
in code
244 1
utils.py
in code/src
236 16
amazon_fader_process.py
in data/Amazon
177 -
loader.py
in code/src/data
174 7
yelp_fader_process.py
in data/Yelp
154 -
multihead_attention.py
in code/src/modules
143 11
dictionary.py
in code/src/data
143 10
multiprocessing_event_loop.py
in code/src
128 13
lm.py
in code/src/model
128 7
dataset.py
in code/src/data
106 9
train-lm.py
in code
92 -
train-classifier.py
in code
90 -
classifier.py
in code/src/model
78 4
pretrain_embeddings.py
in code/src/model
77 4
discriminator.py
in code/src/model
75 4
merge_yelp.py
in data/Yelp
67 -
merge_amazon.py
in data/Amazon
66 -
__init__.py
in code/src/model
63 2
create_human_test_set.py
in data
61 -
sinusoidal_positional_embedding.py
in code/src/modules
54 4
preprocess.py
in code
48 -
remove_unused_labels.py
in data
39 -
logger.py
in code/src
36 3
common_words.py
in code/scripts
35 1
layer_norm.py
in code/src/modules
35 2
attributes.py
in code/src
33 2
check_vocab.py
in code/scripts
32 2
shuffle_lines.py
in code/scripts
31 1
adam_inverse_sqrt_with_warmup.py
in code/src
28 3
test.py
in code/src
27 3
label_smoothed_cross_entropy.py
in code/src/modules
27 2
aes_utils.py
in data
24 5
common_lines.py
in code/scripts
22 1
extract_review.py
in data/Amazon
19 -
extract_review.py
in data/Yelp
19 -
fairseq_utils.py
in code/src
17 3
gumbel.py
in code/src
15 3
filter.py
in data/Amazon
15 -
filter.py
in data/Yelp
15 -
str2hash.py
in data
7 -
__init__.py
in code/src
1 -
__init__.py
in code/src/modules
1 -
__init__.py
in code/src/data
1 -
Files With Most Units (Top 20)
File# lines# units
trainer.py
in code/src
750 54
transformer.py
in code/src/model
329 22
evaluator.py
in code/src
477 19
utils.py
in code/src
236 16
multiprocessing_event_loop.py
in code/src
128 13
attention.py
in code/src/model
398 12
multihead_attention.py
in code/src/modules
143 11
dictionary.py
in code/src/data
143 10
seq2seq.py
in code/src/model
274 9
dataset.py
in code/src/data
106 9
lm.py
in code/src/model
128 7
loader.py
in code/src/data
174 7
aes_utils.py
in data
24 5
sequence_generator.py
in code/src
266 4
classifier.py
in code/src/model
78 4
pretrain_embeddings.py
in code/src/model
77 4
discriminator.py
in code/src/model
75 4
sinusoidal_positional_embedding.py
in code/src/modules
54 4
fairseq_utils.py
in code/src
17 3
test.py
in code/src
27 3
Files With Long Lines (Top 13)

There are 13 files with lines longer than 120 characters. In total, there are 39 long lines.

File# lines# units# long lines
evaluator.py
in code/src
477 19 15
attention.py
in code/src/model
398 12 5
main-parallel.py
in code
244 1 3
common_words.py
in code/scripts
35 1 3
trainer.py
in code/src
750 54 3
__init__.py
in code/src/model
63 2 2
loader.py
in code/src/data
174 7 2
train-classifier.py
in code
90 - 1
train-lm.py
in code
92 - 1
classifier.py
in code/src/model
78 4 1
transformer.py
in code/src/model
329 22 1
lm.py
in code/src/model
128 7 1
seq2seq.py
in code/src/model
274 9 1