facebookresearch / anli
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 25 files with 3,589 lines of code.
    • 1 very long files (1,052 lines of code)
    • 2 long files (1,377 lines of code)
    • 1 medium size files (272 lines of codeclsfd_ftr_w_mp_ins)
    • 3 small files (322 lines of code)
    • 18 very small files (566 lines of code)
29% | 38% | 7% | 8% | 15%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py29% | 38% | 7% | 8% | 15%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
src/modeling91% | 0% | 0% | 8% | <1%
src/nli0% | 82% | 0% | 13% | 3%
src/flint0% | 0% | 99% | 0% | <1%
src/utils0% | 0% | 0% | 0% | 100%
src/dataset_tools0% | 0% | 0% | 0% | 100%
src/flint/data_utils0% | 0% | 0% | 0% | 100%
src/hg_api0% | 0% | 0% | 0% | 100%
src0% | 0% | 0% | 0% | 100%
Longest Files (Top 25)
File# lines# units
dummy_modeling_xlnet.py
in src/modeling
1052 37
training.py
in src/nli
691 14
training_extra.py
in src/nli
686 14
torch_util.py
in src/flint
272 18
inference_debug.py
in src/nli
112 3
evaluation.py
in src/nli
109 1
res_encoder.py
in src/modeling
101 9
build_data.py
in src/dataset_tools
85 5
list_dict_data_tool.py
in src/utils
71 5
format_convert.py
in src/dataset_tools
66 3
inspection_tools.py
in src/nli
65 5
common.py
in src/utils
56 9
save_tool.py
in src/utils
49 5
interactive_eval.py
in src/hg_api
48 1
batchbuilder.py
in src/flint/data_utils
42 4
fields.py
in src/flint/data_utils
41 6
interactive.py
in src/hg_api
30 1
config.py
in src
6 -
__init__.py
in src/nli
1 -
__init__.py
in src/utils
1 -
__init__.py
in src/modeling
1 -
__init__.py
in src/hg_api
1 -
__init__.py
in src/dataset_tools
1 -
__init__.py
in src/flint/data_utils
1 -
__init__.py
in src/flint
1 -
Files With Most Units (Top 17)
File# lines# units
dummy_modeling_xlnet.py
in src/modeling
1052 37
torch_util.py
in src/flint
272 18
training_extra.py
in src/nli
686 14
training.py
in src/nli
691 14
common.py
in src/utils
56 9
res_encoder.py
in src/modeling
101 9
fields.py
in src/flint/data_utils
41 6
inspection_tools.py
in src/nli
65 5
save_tool.py
in src/utils
49 5
list_dict_data_tool.py
in src/utils
71 5
build_data.py
in src/dataset_tools
85 5
batchbuilder.py
in src/flint/data_utils
42 4
inference_debug.py
in src/nli
112 3
format_convert.py
in src/dataset_tools
66 3
evaluation.py
in src/nli
109 1
interactive.py
in src/hg_api
30 1
interactive_eval.py
in src/hg_api
48 1
Files With Long Lines (Top 5)

There are 5 files with lines longer than 120 characters. In total, there are 6 long lines.

File# lines# units# long lines
training.py
in src/nli
691 14 2
training_extra.py
in src/nli
686 14 1
inference_debug.py
in src/nli
112 3 1
dummy_modeling_xlnet.py
in src/modeling
1052 37 1
batchbuilder.py
in src/flint/data_utils
42 4 1