facebookresearch / UnsupervisedDecomposition
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 85 files with 14,894 lines of code.
    • 0 very long files (0 lines of code)
    • 5 long files (3,566 lines of code)
    • 17 medium size files (6,237 lines of codeclsfd_ftr_w_mp_ins)
    • 22 small files (3,230 lines of code)
    • 41 very small files (1,861 lines of code)
0% | 23% | 41% | 21% | 12%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 24% | 42% | 20% | 12%
perl0% | 0% | 0% | 100% | 0%
in0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
pytorch-transformers/pytorch_transformers0% | 35% | 42% | 13% | 7%
XLM/src0% | 56% | 0% | 31% | 12%
XLM/src/evaluation0% | 0% | 70% | 29% | <1%
XLM/src/data0% | 0% | 73% | 25% | <1%
XLM/src/model0% | 0% | 63% | 17% | 18%
XLM/src/model/memory0% | 0% | 61% | 25% | 13%
pytorch-transformers0% | 0% | 36% | 48% | 14%
XLM0% | 0% | 47% | 26% | 25%
pytorch-transformers/pseudoalignment0% | 0% | 0% | 37% | 62%
XLM/tools0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
modeling_transfo_xl.py
in pytorch-transformers/pytorch_transformers
914 50
modeling_xlnet.py
in pytorch-transformers/pytorch_transformers
700 37
tokenization_xlm.py
in pytorch-transformers/pytorch_transformers
699 19
modeling_bert.py
in pytorch-transformers/pytorch_transformers
628 55
trainer.py
in XLM/src
625 28
evaluator.py
in XLM/src/evaluation
496 19
modeling_xlm.py
in pytorch-transformers/pytorch_transformers
495 29
modeling_utils.py
in pytorch-transformers/pytorch_transformers
495 39
tokenization_utils.py
in pytorch-transformers/pytorch_transformers
454 49
transformer.py
in XLM/src/model
448 22
memory.py
in XLM/src/model/memory
446 19
tokenization_transfo_xl.py
in pytorch-transformers/pytorch_transformers
416 33
modeling_distilbert.py
in pytorch-transformers/pytorch_transformers
402 30
modeling_gpt2.py
in pytorch-transformers/pytorch_transformers
394 29
modeling_openai.py
in pytorch-transformers/pytorch_transformers
389 29
add_umt_subqs_subas_to_q_squad_format_new.py
in pytorch-transformers
382 5
tokenization_bert.py
in pytorch-transformers/pytorch_transformers
279 23
modeling_transfo_xl_utilities.py
in pytorch-transformers/pytorch_transformers
245 7
dataset.py
in XLM/src/data
237 21
train.py
in XLM
225 2
loader.py
in XLM/src/data
224 7
glue.py
in XLM/src/evaluation
210 6
query.py
in XLM/src/model/memory
186 13
utils.py
in XLM/src
181 14
modeling_roberta.py
in pytorch-transformers/pytorch_transformers
174 15
optim.py
in XLM/src
173 10
file_utils.py
in pytorch-transformers/pytorch_transformers
171 9
pseudo_decomp_variable.py
in pytorch-transformers/pseudoalignment
170 5
ensemble_answers_by_confidence_script.py
in pytorch-transformers
162 3
dictionary.py
in XLM/src/data
161 11
tokenization_gpt2.py
in pytorch-transformers/pytorch_transformers
154 11
xnli.py
in XLM/src/evaluation
151 6
tokenization_openai.py
in pytorch-transformers/pytorch_transformers
147 10
pseudo_decomp_bert_nsp.py
in pytorch-transformers/pseudoalignment
145 1
tokenization_xlnet.py
in pytorch-transformers/pytorch_transformers
140 12
multi-bleu.perl
in XLM/src/evaluation
139 2
modeling_auto.py
in pytorch-transformers/pytorch_transformers
133 10
convert_roberta_checkpoint_to_pytorch.py
in pytorch-transformers/pytorch_transformers
129 1
translate.py
in XLM
126 2
__init__.py
in XLM/src/model
126 3
download_glue_data.py
in pytorch-transformers
122 5
convert_hotpot2squad_simple_script.py
in pytorch-transformers
122 2
__main__.py
in pytorch-transformers/pytorch_transformers
114 1
umt_gen_subqs_to_squad_format.py
in pytorch-transformers
104 -
embed_questions_with_bert.py
in pytorch-transformers/pseudoalignment
97 7
slurm.py
in XLM/src
97 4
utils.py
in XLM/src/model/memory
97 8
optimization.py
in pytorch-transformers/pytorch_transformers
94 11
convert_pytorch_checkpoint_to_tf.py
in pytorch-transformers/pytorch_transformers
81 2
pseudo_decomp_bert.py
in pytorch-transformers/pseudoalignment
81 1
Files With Most Units (Top 20)
File# lines# units
modeling_bert.py
in pytorch-transformers/pytorch_transformers
628 55
modeling_transfo_xl.py
in pytorch-transformers/pytorch_transformers
914 50
tokenization_utils.py
in pytorch-transformers/pytorch_transformers
454 49
modeling_utils.py
in pytorch-transformers/pytorch_transformers
495 39
modeling_xlnet.py
in pytorch-transformers/pytorch_transformers
700 37
tokenization_transfo_xl.py
in pytorch-transformers/pytorch_transformers
416 33
modeling_distilbert.py
in pytorch-transformers/pytorch_transformers
402 30
modeling_gpt2.py
in pytorch-transformers/pytorch_transformers
394 29
modeling_xlm.py
in pytorch-transformers/pytorch_transformers
495 29
modeling_openai.py
in pytorch-transformers/pytorch_transformers
389 29
trainer.py
in XLM/src
625 28
tokenization_bert.py
in pytorch-transformers/pytorch_transformers
279 23
transformer.py
in XLM/src/model
448 22
dataset.py
in XLM/src/data
237 21
tokenization_xlm.py
in pytorch-transformers/pytorch_transformers
699 19
memory.py
in XLM/src/model/memory
446 19
evaluator.py
in XLM/src/evaluation
496 19
modeling_roberta.py
in pytorch-transformers/pytorch_transformers
174 15
utils.py
in XLM/src
181 14
query.py
in XLM/src/model/memory
186 13
Files With Long Lines (Top 20)

There are 44 files with lines longer than 120 characters. In total, there are 210 long lines.

File# lines# units# long lines
tokenization_xlm.py
in pytorch-transformers/pytorch_transformers
699 19 17
download_glue_data.py
in pytorch-transformers
122 5 15
modeling_bert.py
in pytorch-transformers/pytorch_transformers
628 55 15
evaluator.py
in XLM/src/evaluation
496 19 14
ensemble_answers_by_confidence_script.py
in pytorch-transformers
162 3 9
modeling_utils.py
in pytorch-transformers/pytorch_transformers
495 39 8
tokenization_bert.py
in pytorch-transformers/pytorch_transformers
279 23 8
__main__.py
in pytorch-transformers/pytorch_transformers
114 1 8
modeling_distilbert.py
in pytorch-transformers/pytorch_transformers
402 30 7
transformer.py
in XLM/src/model
448 22 7
trainer.py
in XLM/src
625 28 7
add_umt_subqs_subas_to_q_squad_format_new.py
in pytorch-transformers
382 5 6
modeling_gpt2.py
in pytorch-transformers/pytorch_transformers
394 29 6
modeling_auto.py
in pytorch-transformers/pytorch_transformers
133 10 6
__init__.py
in XLM/src/model
126 3 6
convert_hotpot2squad_simple_script.py
in pytorch-transformers
122 2 5
tokenization_utils.py
in pytorch-transformers/pytorch_transformers
454 49 5
train.py
in XLM
225 2 5
memory.py
in XLM/src/model/memory
446 19 5
glue.py
in XLM/src/evaluation
210 6 5