facebookresearch / reconsider
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 41 files with 8,722 lines of code.
    • 0 very long files (0 lines of code)
    • 3 long files (2,260 lines of code)
    • 11 medium size files (4,067 lines of codeclsfd_ftr_w_mp_ins)
    • 11 small files (1,533 lines of code)
    • 16 very small files (862 lines of code)
0% | 25% | 46% | 17% | 9%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 25% | 46% | 17% | 9%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
pytorch_transformers0% | 29% | 45% | 16% | 8%
ROOT0% | 0% | 57% | 23% | 19%
Longest Files (Top 41)
File# lines# units
modeling_transfo_xl.py
in pytorch_transformers
916 51
modeling_xlnet.py
in pytorch_transformers
709 40
modeling_bert.py
in pytorch_transformers
635 58
modeling_utils.py
in pytorch_transformers
483 36
modeling_xlm.py
in pytorch_transformers
478 29
modeling_distilbert.py
in pytorch_transformers
458 32
tokenization_transfo_xl.py
in pytorch_transformers
415 33
modeling_gpt2.py
in pytorch_transformers
389 29
tokenization_utils.py
in pytorch_transformers
388 41
modeling_openai.py
in pytorch_transformers
387 30
main.py
in root
342 2
tokenization_bert.py
in pytorch_transformers
277 24
modeling_transfo_xl_utilities.py
in pytorch_transformers
245 7
prepro.py
in root
205 5
tokenization_xlm.py
in pytorch_transformers
181 12
file_utils.py
in pytorch_transformers
171 9
tokenization_gpt2.py
in pytorch_transformers
153 11
tokenization_openai.py
in pytorch_transformers
147 10
tokenization_roberta.py
in pytorch_transformers
142 11
tokenization_xlnet.py
in pytorch_transformers
140 12
modeling_roberta.py
in pytorch_transformers
136 13
convert_roberta_checkpoint_to_pytorch.py
in pytorch_transformers
130 1
__main__.py
in pytorch_transformers
114 1
DataLoader.py
in root
111 4
hotpot_evaluate_v1.py
in root
108 8
optimization.py
in pytorch_transformers
95 11
prepare_marked_dataset.py
in root
83 1
convert_pytorch_checkpoint_to_tf.py
in pytorch_transformers
81 2
convert_transfo_xl_checkpoint_to_pytorch.py
in pytorch_transformers
80 1
convert_xlnet_checkpoint_to_pytorch.py
in pytorch_transformers
73 1
modeling_auto.py
in pytorch_transformers
70 4
__init__.py
in pytorch_transformers
48 -
evaluate_qa.py
in root
45 2
convert_openai_checkpoint_to_pytorch.py
in pytorch_transformers
45 1
convert_gpt2_checkpoint_to_pytorch.py
in pytorch_transformers
45 1
convert_xlm_checkpoint_to_pytorch.py
in pytorch_transformers
42 1
modeling.py
in root
41 4
convert_tf_checkpoint_to_pytorch.py
in pytorch_transformers
37 1
tokenization_auto.py
in pytorch_transformers
35 2
tokenization_distilbert.py
in pytorch_transformers
25 -
download_reconsider_models.py
in root
17 -
Files With Most Units (Top 20)
File# lines# units
modeling_bert.py
in pytorch_transformers
635 58
modeling_transfo_xl.py
in pytorch_transformers
916 51
tokenization_utils.py
in pytorch_transformers
388 41
modeling_xlnet.py
in pytorch_transformers
709 40
modeling_utils.py
in pytorch_transformers
483 36
tokenization_transfo_xl.py
in pytorch_transformers
415 33
modeling_distilbert.py
in pytorch_transformers
458 32
modeling_openai.py
in pytorch_transformers
387 30
modeling_gpt2.py
in pytorch_transformers
389 29
modeling_xlm.py
in pytorch_transformers
478 29
tokenization_bert.py
in pytorch_transformers
277 24
modeling_roberta.py
in pytorch_transformers
136 13
tokenization_xlm.py
in pytorch_transformers
181 12
tokenization_xlnet.py
in pytorch_transformers
140 12
tokenization_gpt2.py
in pytorch_transformers
153 11
tokenization_roberta.py
in pytorch_transformers
142 11
optimization.py
in pytorch_transformers
95 11
tokenization_openai.py
in pytorch_transformers
147 10
file_utils.py
in pytorch_transformers
171 9
hotpot_evaluate_v1.py
in root
108 8
Files With Long Lines (Top 20)

There are 21 files with lines longer than 120 characters. In total, there are 91 long lines.

File# lines# units# long lines
modeling_bert.py
in pytorch_transformers
635 58 15
modeling_utils.py
in pytorch_transformers
483 36 8
tokenization_bert.py
in pytorch_transformers
277 24 8
__main__.py
in pytorch_transformers
114 1 8
download_reconsider_models.py
in root
17 - 8
modeling_distilbert.py
in pytorch_transformers
458 32 7
modeling_gpt2.py
in pytorch_transformers
389 29 6
main.py
in root
342 2 5
prepro.py
in root
205 5 4
tokenization_utils.py
in pytorch_transformers
388 41 4
modeling_openai.py
in pytorch_transformers
387 30 3
modeling_roberta.py
in pytorch_transformers
136 13 3
tokenization_xlm.py
in pytorch_transformers
181 12 2
modeling_xlm.py
in pytorch_transformers
478 29 2
tokenization_xlnet.py
in pytorch_transformers
140 12 2
evaluate_qa.py
in root
45 2 1
prepare_marked_dataset.py
in root
83 1 1
convert_roberta_checkpoint_to_pytorch.py
in pytorch_transformers
130 1 1
modeling_transfo_xl_utilities.py
in pytorch_transformers
245 7 1
tokenization_distilbert.py
in pytorch_transformers
25 - 1