awslabs / sockeye
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 66 files with 16,678 lines of code.
    • 3 very long files (3,626 lines of code)
    • 8 long files (5,393 lines of code)
    • 10 medium size files (3,461 lines of codeclsfd_ftr_w_mp_ins)
    • 19 small files (2,812 lines of code)
    • 26 very small files (1,386 lines of code)
21% | 32% | 20% | 16% | 8%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py22% | 32% | 21% | 17% | 7%
js0% | 0% | 0% | 0% | 100%
bib0% | 0% | 0% | 0% | 100%
in0% | 0% | 0% | 0% | 100%
css0% | 0% | 0% | 0% | 100%
cfg0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
sockeye22% | 33% | 21% | 14% | 6%
sockeye_contrib0% | 0% | 0% | 99% | <1%
ROOT0% | 0% | 0% | 52% | 47%
sockeye_contrib/vistools/templates0% | 0% | 0% | 0% | 100%
sockeye_contrib/vistools0% | 0% | 0% | 0% | 100%
sockeye_contrib/benchmark0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
data_io.py
in sockeye
1253 76
data_io_pt.py
in sockeye
1231 76
arguments.py
in sockeye
1142 45
train.py
in sockeye
828 18
train_pt.py
in sockeye
762 18
beam_search.py
in sockeye
651 43
inference.py
in sockeye
651 34
training.py
in sockeye
649 46
training_pt.py
in sockeye
626 39
inference_pt.py
in sockeye
621 34
beam_search_pt.py
in sockeye
605 40
utils.py
in sockeye
490 50
model_pt.py
in sockeye
448 34
layers_pt.py
in sockeye
444 57
model.py
in sockeye
412 30
layers.py
in sockeye
376 43
lexical_constraints.py
in sockeye
326 35
constants.py
in sockeye
264 -
vocab.py
in sockeye
251 21
transformer_pt.py
in sockeye
235 16
transformer.py
in sockeye
215 14
plot_metrics.py
in sockeye_contrib
196 8
translate.py
in sockeye
190 5
translate_pt.py
in sockeye
185 5
loss_pt.py
in sockeye
180 31
encoder.py
in sockeye
170 19
decoder_pt.py
in sockeye
160 16
rouge.py
in sockeye_contrib
154 15
lexicon.py
in sockeye
151 10
checkpoint_decoder.py
in sockeye
146 4
checkpoint_decoder_pt.py
in sockeye
145 5
decoder.py
in sockeye
144 14
loss.py
in sockeye
144 28
encoder_pt.py
in sockeye
132 13
quantization.py
in sockeye
130 9
log.py
in sockeye
126 4
scoring_pt.py
in sockeye
120 5
output_handler.py
in sockeye
120 23
lr_scheduler.py
in sockeye
117 12
setup.py
in root
102 5
average.py
in sockeye
96 7
tree.js
in sockeye_contrib/vistools/templates
95 1
evaluate.py
in sockeye
94 8
scoring.py
in sockeye
91 5
prepare_data.py
in sockeye
90 2
prepare_data_pt.py
in sockeye
90 2
score.py
in sockeye
83 2
score_pt.py
in sockeye
79 2
generate_graphs.py
in sockeye_contrib/vistools
75 4
embeddings.py
in sockeye
75 4
Files With Most Units (Top 20)
File# lines# units
data_io_pt.py
in sockeye
1231 76
data_io.py
in sockeye
1253 76
layers_pt.py
in sockeye
444 57
utils.py
in sockeye
490 50
training.py
in sockeye
649 46
arguments.py
in sockeye
1142 45
layers.py
in sockeye
376 43
beam_search.py
in sockeye
651 43
beam_search_pt.py
in sockeye
605 40
training_pt.py
in sockeye
626 39
lexical_constraints.py
in sockeye
326 35
inference.py
in sockeye
651 34
inference_pt.py
in sockeye
621 34
model_pt.py
in sockeye
448 34
loss_pt.py
in sockeye
180 31
model.py
in sockeye
412 30
loss.py
in sockeye
144 28
output_handler.py
in sockeye
120 23
vocab.py
in sockeye
251 21
encoder.py
in sockeye
170 19
Files With Long Lines (Top 19)

There are 19 files with lines longer than 120 characters. In total, there are 62 long lines.

File# lines# units# long lines
data_io.py
in sockeye
1253 76 13
arguments.py
in sockeye
1142 45 10
inference.py
in sockeye
651 34 5
training_pt.py
in sockeye
626 39 4
training.py
in sockeye
649 46 4
bib
sockeye.bib
in root
53 - 3
vocab.py
in sockeye
251 21 3
beam_search.py
in sockeye
651 43 3
inference_pt.py
in sockeye
621 34 3
data_io_pt.py
in sockeye
1231 76 2
beam_search_pt.py
in sockeye
605 40 2
layers_pt.py
in sockeye
444 57 2
model.py
in sockeye
412 30 2
train.py
in sockeye
828 18 1
encoder_pt.py
in sockeye
132 13 1
optimizers.py
in sockeye
56 3 1
encoder.py
in sockeye
170 19 1
utils.py
in sockeye
490 50 1
translate.py
in sockeye
190 5 1