facebookresearch / TaBERT
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 31 files with 5,706 lines of code.
    • 1 very long files (1,702 lines of code)
    • 0 long files (0 lines of code)
    • 8 medium size files (2,305 lines of codeclsfd_ftr_w_mp_ins)
    • 8 small files (1,346 lines of code)
    • 14 very small files (353 lines of code)
29% | 0% | 40% | 23% | 6%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py30% | 0% | 40% | 23% | 4%
java0% | 0% | 0% | 0% | 100%
yml0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
preprocess71% | 0% | 24% | 0% | 3%
table_bert0% | 0% | 63% | 34% | 2%
table_bert/vertical0% | 0% | 71% | 23% | 4%
ROOT0% | 0% | 95% | 0% | 4%
utils0% | 0% | 0% | 86% | 13%
contrib/wiki_extractor/src/main/java0% | 0% | 0% | 0% | 100%
scripts0% | 0% | 0% | 0% | 100%
Longest Files (Top 31)
File# lines# units
WikiExtractor.py
in preprocess
1702 88
dataset.py
in table_bert
451 30
htmltable.py
in preprocess
376 32
vertical_attention_table_bert.py
in table_bert/vertical
353 17
input_formatter.py
in table_bert
251 8
train.py
in root
236 2
vanilla_table_bert.py
in table_bert
218 8
dataset.py
in table_bert/vertical
218 8
extract_wiki_data.py
in preprocess
202 10
generate_vertical_tabert_training_data.py
in utils
193 3
input_formatter.py
in table_bert/vertical
186 6
table_bert.py
in table_bert
182 8
trainer.py
in utils
180 16
config.py
in table_bert
176 12
comm.py
in utils
155 10
generate_vanilla_tabert_training_data.py
in utils
140 2
table.py
in table_bert
134 16
table.py
in preprocess
54 3
MediaWikiToHtml.java
in contrib/wiki_extractor/src/main/java
50 2
evaluator.py
in utils
44 2
bin_to_h5.py
in utils
39 2
data_utils.py
in preprocess
38 4
config.py
in table_bert/vertical
37 3
utils.py
in table_bert
27 -
env.yml
in scripts
26 -
util.py
in utils
18 1
setup.py
in root
12 -
__init__.py
in table_bert
5 -
__init__.py
in preprocess
1 -
__init__.py
in utils
1 -
__init__.py
in table_bert/vertical
1 -
Files With Most Units (Top 20)
File# lines# units
WikiExtractor.py
in preprocess
1702 88
htmltable.py
in preprocess
376 32
dataset.py
in table_bert
451 30
vertical_attention_table_bert.py
in table_bert/vertical
353 17
trainer.py
in utils
180 16
table.py
in table_bert
134 16
config.py
in table_bert
176 12
extract_wiki_data.py
in preprocess
202 10
comm.py
in utils
155 10
table_bert.py
in table_bert
182 8
input_formatter.py
in table_bert
251 8
vanilla_table_bert.py
in table_bert
218 8
dataset.py
in table_bert/vertical
218 8
input_formatter.py
in table_bert/vertical
186 6
data_utils.py
in preprocess
38 4
table.py
in preprocess
54 3
generate_vertical_tabert_training_data.py
in utils
193 3
config.py
in table_bert/vertical
37 3
bin_to_h5.py
in utils
39 2
generate_vanilla_tabert_training_data.py
in utils
140 2
Files With Long Lines (Top 15)

There are 15 files with lines longer than 120 characters. In total, there are 32 long lines.

File# lines# units# long lines
vertical_attention_table_bert.py
in table_bert/vertical
353 17 7
dataset.py
in table_bert/vertical
218 8 7
train.py
in root
236 2 3
WikiExtractor.py
in preprocess
1702 88 2
dataset.py
in table_bert
451 30 2
input_formatter.py
in table_bert/vertical
186 6 2
extract_wiki_data.py
in preprocess
202 10 1
util.py
in utils
18 1 1
generate_vanilla_tabert_training_data.py
in utils
140 2 1
generate_vertical_tabert_training_data.py
in utils
193 3 1
MediaWikiToHtml.java
in contrib/wiki_extractor/src/main/java
50 2 1
env.yml
in scripts
26 - 1
config.py
in table_bert
176 12 1
table_bert.py
in table_bert
182 8 1
vanilla_table_bert.py
in table_bert
218 8 1