facebookresearch / bio-lm
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 9 files with 2,543 lines of code.
    • 0 very long files (0 lines of code)
    • 3 long files (1,782 lines of code)
    • 2 medium size files (421 lines of codeclsfd_ftr_w_mp_ins)
    • 2 small files (297 lines of code)
    • 2 very small files (43 lines of code)
0% | 70% | 16% | 11% | 1%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 70% | 16% | 11% | 1%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
biolm0% | 93% | 0% | 6% | <1%
preprocessing0% | 0% | 66% | 27% | 6%
Longest Files (Top 9)
File# lines# units
utils_classification.py
in biolm
632 53
run_classification.py
in biolm
616 5
run_sequence_labelling.py
in biolm
534 5
preprocess_i2b2_2014_ner.py
in preprocessing
211 4
preprocess_i2b2_2012_ner.py
in preprocessing
210 4
preprocess_i2b2_2010_ner.py
in preprocessing
174 4
utils_sequence_labelling.py
in biolm
123 5
clean_conll_file.py
in preprocessing
42 1
__init__.py
in biolm
1 -
Files With Most Units (Top 8)
File# lines# units
utils_classification.py
in biolm
632 53
utils_sequence_labelling.py
in biolm
123 5
run_sequence_labelling.py
in biolm
534 5
run_classification.py
in biolm
616 5
preprocess_i2b2_2012_ner.py
in preprocessing
210 4
preprocess_i2b2_2010_ner.py
in preprocessing
174 4
preprocess_i2b2_2014_ner.py
in preprocessing
211 4
clean_conll_file.py
in preprocessing
42 1
Files With Long Lines (Top 4)

There are 4 files with lines longer than 120 characters. In total, there are 5 long lines.

File# lines# units# long lines
run_classification.py
in biolm
616 5 2
utils_classification.py
in biolm
632 53 1
run_sequence_labelling.py
in biolm
534 5 1
preprocess_i2b2_2012_ner.py
in preprocessing
210 4 1