amazon-research / panrep
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 21 files with 18,895 lines of code.
    • 2 very long files (12,398 lines of code)
    • 6 long files (4,114 lines of code)
    • 6 medium size files (1,886 lines of codeclsfd_ftr_w_mp_ins)
    • 2 small files (332 lines of code)
    • 5 very small files (165 lines of code)
65% | 21% | 9% | 1% | <1%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py65% | 21% | 9% | 1% | <1%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
panrep71% | 17% | 10% | 0% | <1%
data_handler/imdb0% | 75% | 0% | 24% | 0%
data_handler/cancer_network0% | 0% | 0% | 0% | 100%
data_handler0% | 0% | 0% | 0% | 100%
data_handler/drugbank0% | 0% | 0% | 0% | 100%
Longest Files (Top 21)
File# lines# units
plot_data.py
in panrep
10246 52
load_data.py
in panrep
2152 47
evaluation.py
in panrep
948 17
decoders.py
in panrep
907 68
panrep_mb_lg_scale_homo.py
in panrep
713 10
imdb_data_loader.py
in data_handler/imdb
517 10
imbd_data_loader.py
in data_handler/imdb
517 10
classifiers.py
in panrep
512 46
layers.py
in panrep
482 24
node_sampling_masking.py
in panrep
399 17
edge_masking_samling.py
in panrep
280 7
encoders.py
in panrep
277 20
utils.py
in panrep
237 15
model.py
in panrep
211 12
imdb_data_loader_xiang.py
in data_handler/imdb
190 6
imdb_data_to_graph.py
in data_handler/imdb
142 4
multimodal_cancer_network.py
in data_handler/cancer_network
77 -
multimodal_cancer_network.py
in data_handler
77 -
graph_supervision_tasks.py
in panrep
9 3
__init__.py
in panrep
1 -
__init__.py
in data_handler/drugbank
1 -
Files With Most Units (Top 17)
File# lines# units
decoders.py
in panrep
907 68
plot_data.py
in panrep
10246 52
load_data.py
in panrep
2152 47
classifiers.py
in panrep
512 46
layers.py
in panrep
482 24
encoders.py
in panrep
277 20
node_sampling_masking.py
in panrep
399 17
evaluation.py
in panrep
948 17
utils.py
in panrep
237 15
model.py
in panrep
211 12
panrep_mb_lg_scale_homo.py
in panrep
713 10
imdb_data_loader.py
in data_handler/imdb
517 10
imbd_data_loader.py
in data_handler/imdb
517 10
edge_masking_samling.py
in panrep
280 7
imdb_data_loader_xiang.py
in data_handler/imdb
190 6
imdb_data_to_graph.py
in data_handler/imdb
142 4
graph_supervision_tasks.py
in panrep
9 3
Files With Long Lines (Top 12)

There are 12 files with lines longer than 120 characters. In total, there are 413 long lines.

File# lines# units# long lines
plot_data.py
in panrep
10246 52 310
panrep_mb_lg_scale_homo.py
in panrep
713 10 40
load_data.py
in panrep
2152 47 29
evaluation.py
in panrep
948 17 13
decoders.py
in panrep
907 68 12
multimodal_cancer_network.py
in data_handler/cancer_network
77 - 2
multimodal_cancer_network.py
in data_handler
77 - 2
classifiers.py
in panrep
512 46 1
encoders.py
in panrep
277 20 1
node_sampling_masking.py
in panrep
399 17 1
model.py
in panrep
211 12 1
edge_masking_samling.py
in panrep
280 7 1