awslabs / gap-text2sql
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 100 files with 15,464 lines of code.
    • 1 very long files (1,239 lines of code)
    • 11 long files (7,129 lines of code)
    • 10 medium size files (2,727 lines of codeclsfd_ftr_w_mp_ins)
    • 15 small files (2,296 lines of code)
    • 63 very small files (2,073 lines of code)
8% | 46% | 17% | 14% | 13%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py8% | 46% | 17% | 14% | 12%
jsonnet0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
rat-sql-gap/seq2struct/models29% | 27% | 16% | 17% | 8%
relogic/pretrainkit/models0% | 85% | 6% | 0% | 7%
relogic/pretrainkit0% | 87% | 0% | 11% | <1%
rat-sql-gap/seq2struct/datasets0% | 45% | 41% | 7% | 5%
rat-sql-gap/seq2struct/grammars0% | 99% | 0% | 0% | <1%
relogic/pretrainkit/datasets0% | 0% | 29% | 46% | 24%
relogic0% | 0% | 40% | 59% | 0%
relogic/logickit/utils0% | 0% | 95% | 0% | 4%
rat-sql-gap/seq2struct/commands0% | 0% | 43% | 41% | 14%
rat-sql-gap/seq2struct0% | 0% | 51% | 37% | 11%
rat-sql-gap/seq2struct/utils0% | 0% | 0% | 29% | 70%
rat-sql-gap/seq2struct/resources0% | 0% | 0% | 0% | 100%
relogic/pretrainkit/scorers0% | 0% | 0% | 0% | 100%
rat-sql-gap0% | 0% | 0% | 0% | 100%
rat-sql-gap/configs/gap0% | 0% | 0% | 0% | 100%
relogic/logickit/base0% | 0% | 0% | 0% | 100%
relogic/logickit/modules0% | 0% | 0% | 0% | 100%
rat-sql-gap/experiments/spider-configs0% | 0% | 0% | 0% | 100%
relogic/logickit0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
spider_enc.py
in rat-sql-gap/seq2struct/models/spider
1239 55
modeling_bart_copy.py
in relogic/pretrainkit/models/semparse
809 53
semparse.py
in relogic/pretrainkit/models/semparse
704 20
relational_semparse.py
in relogic/pretrainkit/models/relationalsemparse
697 20
modeling_relational_bart.py
in relogic/pretrainkit/models/relationalsemparse
694 41
evaluation.py
in rat-sql-gap/seq2struct/datasets/spider_lib
667 50
decoder.py
in rat-sql-gap/seq2struct/models/nl2code
666 32
multitask_trainer.py
in relogic/pretrainkit
636 33
trainer.py
in relogic/pretrainkit
613 31
spider.py
in rat-sql-gap/seq2struct/grammars
594 30
relational_transformer.py
in relogic/pretrainkit/models/relationalsemparse
527 26
spider_enc_modules.py
in rat-sql-gap/seq2struct/models/spider
522 22
process_sql.py
in rat-sql-gap/seq2struct/datasets/spider_lib
384 26
tree_traversal.py
in rat-sql-gap/seq2struct/models/nl2code
372 23
spider_beam_search.py
in rat-sql-gap/seq2struct/models/spider
334 4
logical_tabart.py
in relogic/pretrainkit/models/semparse
276 9
tabart.py
in relogic/pretrainkit/datasets/semparse
270 11
logical-tabart-pretraining.py
in relogic
235 6
spider.py
in rat-sql-gap/seq2struct/datasets
232 15
utils.py
in relogic/logickit/utils
212 24
infer.py
in rat-sql-gap/seq2struct/commands
208 9
ast_util.py
in rat-sql-gap/seq2struct
204 17
train.py
in rat-sql-gap/seq2struct/commands
200 8
transformer.py
in rat-sql-gap/seq2struct/models
186 20
variational_lstm.py
in rat-sql-gap/seq2struct/models
183 10
entity-to-text-train.py
in relogic
172 3
rat_text2sql.py
in relogic/pretrainkit/datasets/semparse
169 4
sql-to-text-train.py
in relogic
168 3
training_args.py
in relogic/pretrainkit
163 8
batched_sequence.py
in rat-sql-gap/seq2struct/utils
162 20
infer_tree_traversal.py
in rat-sql-gap/seq2struct/models/nl2code
158 7
column_inferring.py
in relogic/pretrainkit/datasets/semparse
158 8
optimizers.py
in rat-sql-gap/seq2struct
149 10
text2sql.py
in relogic/pretrainkit/datasets/semparse
109 5
enc_dec.py
in rat-sql-gap/seq2struct/models
108 16
spider_match_utils.py
in rat-sql-gap/seq2struct/models/spider
106 2
get_tables.py
in rat-sql-gap/seq2struct/datasets/spider_lib/preprocess
105 2
tabart.py
in relogic/pretrainkit/models/semparse
97 4
saver.py
in rat-sql-gap/seq2struct/utils
95 8
train_tree_traversal.py
in rat-sql-gap/seq2struct/models/nl2code
95 8
gap-bart.jsonnet
in rat-sql-gap/configs/gap
93 -
run.py
in rat-sql-gap
90 1
encoder.py
in rat-sql-gap/seq2struct/models/nl2code
88 12
pretrained_embeddings.py
in rat-sql-gap/seq2struct/resources
88 18
attention.py
in rat-sql-gap/seq2struct/models
87 11
multid.py
in relogic/pretrainkit/datasets/semparse
81 5
entity_to_text.py
in relogic/pretrainkit/datasets/text_generation
76 5
vocab.py
in rat-sql-gap/seq2struct/utils
74 18
text_generation.py
in relogic/pretrainkit/scorers
63 4
sql_to_text.py
in relogic/pretrainkit/datasets/text_generation
57 5
Files With Most Units (Top 20)
File# lines# units
spider_enc.py
in rat-sql-gap/seq2struct/models/spider
1239 55
modeling_bart_copy.py
in relogic/pretrainkit/models/semparse
809 53
evaluation.py
in rat-sql-gap/seq2struct/datasets/spider_lib
667 50
modeling_relational_bart.py
in relogic/pretrainkit/models/relationalsemparse
694 41
multitask_trainer.py
in relogic/pretrainkit
636 33
decoder.py
in rat-sql-gap/seq2struct/models/nl2code
666 32
trainer.py
in relogic/pretrainkit
613 31
spider.py
in rat-sql-gap/seq2struct/grammars
594 30
process_sql.py
in rat-sql-gap/seq2struct/datasets/spider_lib
384 26
relational_transformer.py
in relogic/pretrainkit/models/relationalsemparse
527 26
utils.py
in relogic/logickit/utils
212 24
tree_traversal.py
in rat-sql-gap/seq2struct/models/nl2code
372 23
spider_enc_modules.py
in rat-sql-gap/seq2struct/models/spider
522 22
batched_sequence.py
in rat-sql-gap/seq2struct/utils
162 20
transformer.py
in rat-sql-gap/seq2struct/models
186 20
relational_semparse.py
in relogic/pretrainkit/models/relationalsemparse
697 20
semparse.py
in relogic/pretrainkit/models/semparse
704 20
vocab.py
in rat-sql-gap/seq2struct/utils
74 18
pretrained_embeddings.py
in rat-sql-gap/seq2struct/resources
88 18
ast_util.py
in rat-sql-gap/seq2struct
204 17
Files With Long Lines (Top 20)

There are 26 files with lines longer than 120 characters. In total, there are 108 long lines.

File# lines# units# long lines
multitask_trainer.py
in relogic/pretrainkit
636 33 16
trainer.py
in relogic/pretrainkit
613 31 14
logical-tabart-pretraining.py
in relogic
235 6 12
relational_semparse.py
in relogic/pretrainkit/models/relationalsemparse
697 20 10
variational_lstm.py
in rat-sql-gap/seq2struct/models
183 10 9
semparse.py
in relogic/pretrainkit/models/semparse
704 20 7
optimizers.py
in rat-sql-gap/seq2struct
149 10 4
entity-to-text-train.py
in relogic
172 3 4
sql-to-text-train.py
in relogic
168 3 4
infer.py
in rat-sql-gap/seq2struct/commands
208 9 3
get_tables.py
in rat-sql-gap/seq2struct/datasets/spider_lib/preprocess
105 2 2
evaluation.py
in rat-sql-gap/seq2struct/datasets/spider_lib
667 50 2
spider_beam_search.py
in rat-sql-gap/seq2struct/models/spider
334 4 2
spider_enc_modules.py
in rat-sql-gap/seq2struct/models/spider
522 22 2
train.py
in rat-sql-gap/seq2struct/commands
200 8 2
sql_to_text.py
in relogic/pretrainkit/datasets/text_generation
57 5 2
tabart.py
in relogic/pretrainkit/datasets/semparse
270 11 2
multid.py
in relogic/pretrainkit/datasets/semparse
81 5 2
modeling_relational_bart.py
in relogic/pretrainkit/models/relationalsemparse
694 41 2
process_sql.py
in rat-sql-gap/seq2struct/datasets/spider_lib
384 26 1