facebookresearch / nbref
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 31 files with 4,796 lines of code.
    • 0 very long files (0 lines of code)
    • 1 long files (548 lines of code)
    • 7 medium size files (1,987 lines of codeclsfd_ftr_w_mp_ins)
    • 12 small files (1,790 lines of code)
    • 11 very small files (471 lines of code)
0% | 11% | 41% | 37% | 9%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 11% | 41% | 37% | 9%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
preprocess0% | 37% | 25% | 28% | 8%
baseline_model/data_utils0% | 0% | 48% | 38% | 12%
baseline_model/modules0% | 0% | 100% | 0% | 0%
baseline_model0% | 0% | 28% | 60% | 11%
preprocess/cram_vul_dataset0% | 0% | 0% | 100% | 0%
Longest Files (Top 31)
File# lines# units
asm_mips.py
in preprocess
548 19
train_tree_encoder.py
in baseline_model/data_utils
464 10
asm_obj.py
in preprocess
365 21
train_tree_encoder_v2.py
in baseline_model/data_utils
259 2
transformer_tree_model.py
in baseline_model/modules
235 16
encoder_decoder_layers.py
in baseline_model/modules
226 17
train_gnn.py
in baseline_model/data_utils
221 13
run_tree_transformer_multi_gpu.py
in baseline_model
217 6
train_vul.py
in baseline_model/data_utils
191 18
run_tree_transformer.py
in baseline_model
189 4
extract_obj.py
in preprocess
176 8
run_vulnerability_detection.py
in baseline_model
153 2
src2asm.py
in preprocess/cram_vul_dataset
152 1
data_utils.py
in baseline_model/data_utils
148 17
train.py
in baseline_model/data_utils
147 11
train_sim.py
in baseline_model/data_utils
140 11
ggnn_utils.py
in baseline_model/data_utils
130 5
sim_preprocess.py
in preprocess
124 9
config.py
in baseline_model
124 1
vul_preprocess.py
in preprocess
116 9
run_similarity_check.py
in baseline_model
90 1
dump_trace.py
in preprocess
77 1
Optim.py
in baseline_model/data_utils
77 13
dataset.py
in baseline_model/data_utils
67 6
Tree.py
in baseline_model/data_utils
65 7
split_dataset.py
in preprocess
52 3
gnn_dataset.py
in baseline_model/data_utils
36 3
Constants.py
in baseline_model/data_utils
4 -
__init__.py
in preprocess
1 -
__init__.py
in baseline_model/data_utils
1 -
__init__.py
in baseline_model
1 -
Files With Most Units (Top 20)
File# lines# units
asm_obj.py
in preprocess
365 21
asm_mips.py
in preprocess
548 19
train_vul.py
in baseline_model/data_utils
191 18
data_utils.py
in baseline_model/data_utils
148 17
encoder_decoder_layers.py
in baseline_model/modules
226 17
transformer_tree_model.py
in baseline_model/modules
235 16
train_gnn.py
in baseline_model/data_utils
221 13
Optim.py
in baseline_model/data_utils
77 13
train_sim.py
in baseline_model/data_utils
140 11
train.py
in baseline_model/data_utils
147 11
train_tree_encoder.py
in baseline_model/data_utils
464 10
vul_preprocess.py
in preprocess
116 9
sim_preprocess.py
in preprocess
124 9
extract_obj.py
in preprocess
176 8
Tree.py
in baseline_model/data_utils
65 7
dataset.py
in baseline_model/data_utils
67 6
run_tree_transformer_multi_gpu.py
in baseline_model
217 6
ggnn_utils.py
in baseline_model/data_utils
130 5
run_tree_transformer.py
in baseline_model
189 4
split_dataset.py
in preprocess
52 3
Files With Long Lines (Top 12)

There are 12 files with lines longer than 120 characters. In total, there are 60 long lines.

File# lines# units# long lines
train_tree_encoder.py
in baseline_model/data_utils
464 10 13
asm_obj.py
in preprocess
365 21 11
train_tree_encoder_v2.py
in baseline_model/data_utils
259 2 9
run_tree_transformer_multi_gpu.py
in baseline_model
217 6 8
run_tree_transformer.py
in baseline_model
189 4 5
extract_obj.py
in preprocess
176 8 4
src2asm.py
in preprocess/cram_vul_dataset
152 1 3
run_vulnerability_detection.py
in baseline_model
153 2 3
asm_mips.py
in preprocess
548 19 1
dump_trace.py
in preprocess
77 1 1
ggnn_utils.py
in baseline_model/data_utils
130 5 1
data_utils.py
in baseline_model/data_utils
148 17 1