amazon-research / meta-learning-the-difference
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 42 files with 6,482 lines of code.
    • 0 very long files (0 lines of code)
    • 1 long files (609 lines of code)
    • 11 medium size files (3,053 lines of codeclsfd_ftr_w_mp_ins)
    • 11 small files (1,713 lines of code)
    • 19 very small files (1,107 lines of code)
0% | 9% | 47% | 26% | 17%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 9% | 48% | 24% | 17%
perl0% | 0% | 0% | 100% | 0%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
low_rank_comparisons/src0% | 30% | 35% | 15% | 18%
dialogue_personalization/model0% | 0% | 100% | 0% | 0%
dialogue_personalization/utils0% | 0% | 45% | 48% | 6%
abstractive_summarization/src0% | 0% | 38% | 37% | 23%
abstractive_summarization/src/others0% | 0% | 49% | 22% | 27%
low_rank_comparisons/eval0% | 0% | 100% | 0% | 0%
dialogue_personalization0% | 0% | 0% | 41% | 58%
Longest Files (Top 42)
File# lines# units
model.py
in low_rank_comparisons/src
609 36
data_reader.py
in dialogue_personalization/utils
449 29
common_layer.py
in dialogue_personalization/model
373 35
transformer.py
in dialogue_personalization/model
325 14
my_pyrouge.py
in abstractive_summarization/src/others
318 31
eval.py
in low_rank_comparisons/eval
264 9
gpt2_ft.py
in low_rank_comparisons/src
251 7
gpt2_beam.py
in low_rank_comparisons/src
237 7
dapt_pretraining.py
in abstractive_summarization/src
212 11
tapt_pretraining.py
in abstractive_summarization/src
212 11
optimizer.py
in low_rank_comparisons/src
208 19
metric.py
in dialogue_personalization/utils
204 16
data_utils.py
in low_rank_comparisons/src
198 17
beam_omt.py
in dialogue_personalization/utils
190 13
beam_ptr.py
in dialogue_personalization/utils
184 11
load_bert.py
in dialogue_personalization/utils
178 17
trainer.py
in abstractive_summarization/src
158 5
MAML.py
in dialogue_personalization
155 6
optimizer.py
in abstractive_summarization/src/others
141 5
multi-bleu.perl
in dialogue_personalization/utils
140 2
run.py
in abstractive_summarization/src
133 4
sdpt_pretraining.py
in abstractive_summarization/src
124 2
gpt2_decode.py
in low_rank_comparisons/src
112 2
preprocessing.py
in abstractive_summarization/src
100 9
encoder.py
in low_rank_comparisons/src
98 7
cal_rouge.py
in abstractive_summarization/src
97 4
utils.py
in abstractive_summarization/src/others
96 12
gpu.py
in low_rank_comparisons/src
93 6
config.py
in dialogue_personalization/utils
92 -
main_fine_tune.py
in dialogue_personalization
77 1
generate_samples.py
in dialogue_personalization
69 2
recadam.py
in abstractive_summarization/src/others
65 3
inference.py
in abstractive_summarization/src
62 1
gpt2_encode.py
in low_rank_comparisons/src
51 -
main.py
in dialogue_personalization
42 1
format_converting_webnlg.py
in low_rank_comparisons/src
41 -
interact.py
in dialogue_personalization
33 1
format_converting_dart.py
in low_rank_comparisons/src
31 -
exp_utils.py
in low_rank_comparisons/src
30 4
logging.py
in abstractive_summarization/src/others
16 1
format_converting_e2e.py
in low_rank_comparisons/src
13 -
__init__.py
in abstractive_summarization/src/others
1 -
Files With Most Units (Top 20)
File# lines# units
model.py
in low_rank_comparisons/src
609 36
common_layer.py
in dialogue_personalization/model
373 35
my_pyrouge.py
in abstractive_summarization/src/others
318 31
data_reader.py
in dialogue_personalization/utils
449 29
optimizer.py
in low_rank_comparisons/src
208 19
data_utils.py
in low_rank_comparisons/src
198 17
load_bert.py
in dialogue_personalization/utils
178 17
metric.py
in dialogue_personalization/utils
204 16
transformer.py
in dialogue_personalization/model
325 14
beam_omt.py
in dialogue_personalization/utils
190 13
utils.py
in abstractive_summarization/src/others
96 12
beam_ptr.py
in dialogue_personalization/utils
184 11
dapt_pretraining.py
in abstractive_summarization/src
212 11
tapt_pretraining.py
in abstractive_summarization/src
212 11
eval.py
in low_rank_comparisons/eval
264 9
preprocessing.py
in abstractive_summarization/src
100 9
gpt2_beam.py
in low_rank_comparisons/src
237 7
encoder.py
in low_rank_comparisons/src
98 7
gpt2_ft.py
in low_rank_comparisons/src
251 7
gpu.py
in low_rank_comparisons/src
93 6
Files With Long Lines (Top 20)

There are 28 files with lines longer than 120 characters. In total, there are 120 long lines.

File# lines# units# long lines
dapt_pretraining.py
in abstractive_summarization/src
212 11 11
tapt_pretraining.py
in abstractive_summarization/src
212 11 11
transformer.py
in dialogue_personalization/model
325 14 9
sdpt_pretraining.py
in abstractive_summarization/src
124 2 8
run.py
in abstractive_summarization/src
133 4 8
trainer.py
in abstractive_summarization/src
158 5 8
beam_omt.py
in dialogue_personalization/utils
190 13 7
gpt2_beam.py
in low_rank_comparisons/src
237 7 6
model.py
in low_rank_comparisons/src
609 36 6
MAML.py
in dialogue_personalization
155 6 6
main_fine_tune.py
in dialogue_personalization
77 1 5
eval.py
in low_rank_comparisons/eval
264 9 4
gpu.py
in low_rank_comparisons/src
93 6 4
recadam.py
in abstractive_summarization/src/others
65 3 4
optimizer.py
in low_rank_comparisons/src
208 19 3
gpt2_decode.py
in low_rank_comparisons/src
112 2 3
beam_ptr.py
in dialogue_personalization/utils
184 11 3
gpt2_ft.py
in low_rank_comparisons/src
251 7 2
data_reader.py
in dialogue_personalization/utils
449 29 2
load_bert.py
in dialogue_personalization/utils
178 17 2