facebookresearch / NeuralDB
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 73 files with 6,166 lines of code.
    • 0 very long files (0 lines of code)
    • 1 long files (525 lines of code)
    • 6 medium size files (2,002 lines of codeclsfd_ftr_w_mp_ins)
    • 18 small files (2,452 lines of code)
    • 48 very small files (1,187 lines of code)
0% | 8% | 32% | 39% | 19%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 8% | 32% | 39% | 19%
cfg0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
dataset-construction/src/ndb_data/generation0% | 43% | 22% | 25% | 8%
modelling/src/neuraldb0% | 0% | 74% | 12% | 13%
dataset-construction/src/ndb_data/construction0% | 0% | 51% | 36% | 11%
modelling/src/neuraldb/modelling0% | 0% | 99% | 0% | <1%
dataset-construction/src/ndb_data0% | 0% | 0% | 95% | 4%
modelling/src/neuraldb/dataset0% | 0% | 0% | 70% | 29%
ssg0% | 0% | 0% | 64% | 35%
modelling/src/neuraldb/evaluation0% | 0% | 0% | 80% | 19%
dataset-construction/src/ndb_data/data_import0% | 0% | 0% | 0% | 100%
modelling/src/neuraldb/retriever0% | 0% | 0% | 0% | 100%
dataset-construction/src/ndb_data/util0% | 0% | 0% | 0% | 100%
dataset-construction/src/ndb_data/wikidata_common0% | 0% | 0% | 0% | 100%
modelling0% | 0% | 0% | 0% | 100%
modelling/src/neuraldb/util0% | 0% | 0% | 0% | 100%
ROOT0% | 0% | 0% | 0% | 100%
modelling/src0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
question_to_db.py
in dataset-construction/src/ndb_data/generation
525 14
run.py
in modelling/src/neuraldb
421 6
make_questions.py
in dataset-construction/src/ndb_data/construction
420 10
convert_spj_to_predictions.py
in modelling/src/neuraldb
366 12
neuraldb_trainer.py
in modelling/src/neuraldb/modelling
295 4
map_kelm.py
in dataset-construction/src/ndb_data/generation
278 11
final_scoring_with_dbsize_sweep.py
in modelling/src/neuraldb
222 1
instance_generator.py
in modelling/src/neuraldb/dataset/instance_generator
194 12
template_first_db.py
in dataset-construction/src/ndb_data/generation
187 2
final_scoring_with_dbsize.py
in modelling/src/neuraldb
170 1
make_database_finalize.py
in dataset-construction/src/ndb_data/construction
165 2
evaluate_set_ssg.py
in ssg
138 3
spj_generator.py
in modelling/src/neuraldb/dataset/instance_generator
134 3
sample_questions_100.py
in dataset-construction/src/ndb_data
129 1
sample_questions_250.py
in dataset-construction/src/ndb_data
129 1
sample_questions_500.py
in dataset-construction/src/ndb_data
129 1
sample_questions_1000.py
in dataset-construction/src/ndb_data
129 1
make_database_initial.py
in dataset-construction/src/ndb_data/construction
128 1
sample_questions_50.py
in dataset-construction/src/ndb_data
125 1
postprocess_baselines.py
in modelling/src/neuraldb/evaluation
122 1
sample_questions.py
in dataset-construction/src/ndb_data
122 1
data_collator_seq2seq.py
in modelling/src/neuraldb/dataset
120 1
finalize_hypothesis.py
in dataset-construction/src/ndb_data/generation
120 2
ssg_prediction.py
in ssg
110 1
postprocess_spj.py
in modelling/src/neuraldb/evaluation
101 1
make_database_initial_cache.py
in dataset-construction/src/ndb_data/construction
96 2
final_scoring.py
in modelling/src/neuraldb
74 1
train_ssg.py
in ssg
73 1
wikidata_index.py
in dataset-construction/src/ndb_data/data_import
70 3
build_json.py
in dataset-construction/src/ndb_data/util
65 3
ssg_utils.py
in ssg
62 3
dpr.py
in modelling/src/neuraldb/retriever
60 2
scoring_functions.py
in modelling/src/neuraldb/evaluation
52 9
dataset_statistics.py
in modelling/src/neuraldb
41 3
tfidf.py
in modelling/src/neuraldb/retriever
39 3
kelm_data.py
in dataset-construction/src/ndb_data/data_import
35 -
convert_ssg_predictions.py
in modelling/src/neuraldb
34 -
dataset_statistics.py
in dataset-construction/src/ndb_data
33 2
describe_dbs.py
in dataset-construction/src/ndb_data/generation
32 1
plot_db_sizes.py
in dataset-construction/src/ndb_data/generation
32 -
neuraldb_parser.py
in modelling/src/neuraldb/dataset
31 7
convert_legacy_predictions.py
in modelling/src/neuraldb
30 -
fix_sitelinks.py
in dataset-construction/src/ndb_data/data_import
30 1
setup.py
in modelling
28 -
seq2seq_dataset.py
in modelling/src/neuraldb/dataset
26 4
neuraldb_file_reader.py
in modelling/src/neuraldb/dataset
24 2
externalir_generator_maxtok.py
in modelling/src/neuraldb/dataset/instance_generator
24 1
filter_db_facts.py
in dataset-construction/src/ndb_data/generation
23 -
externalir_generator.py
in modelling/src/neuraldb/dataset/instance_generator
20 1
perfectir_generator.py
in modelling/src/neuraldb/dataset/instance_generator
20 1
Files With Most Units (Top 20)
File# lines# units
question_to_db.py
in dataset-construction/src/ndb_data/generation
525 14
instance_generator.py
in modelling/src/neuraldb/dataset/instance_generator
194 12
convert_spj_to_predictions.py
in modelling/src/neuraldb
366 12
map_kelm.py
in dataset-construction/src/ndb_data/generation
278 11
make_questions.py
in dataset-construction/src/ndb_data/construction
420 10
scoring_functions.py
in modelling/src/neuraldb/evaluation
52 9
neuraldb_parser.py
in modelling/src/neuraldb/dataset
31 7
run.py
in modelling/src/neuraldb
421 6
neuraldb_trainer.py
in modelling/src/neuraldb/modelling
295 4
seq2seq_dataset.py
in modelling/src/neuraldb/dataset
26 4
wikidata.py
in dataset-construction/src/ndb_data/wikidata_common
13 4
spj_generator.py
in modelling/src/neuraldb/dataset/instance_generator
134 3
tfidf.py
in modelling/src/neuraldb/retriever
39 3
dataset_statistics.py
in modelling/src/neuraldb
41 3
build_json.py
in dataset-construction/src/ndb_data/util
65 3
wikidata_index.py
in dataset-construction/src/ndb_data/data_import
70 3
kelm.py
in dataset-construction/src/ndb_data/wikidata_common
13 3
ssg_utils.py
in ssg
62 3
evaluate_set_ssg.py
in ssg
138 3
neuraldb_file_reader.py
in modelling/src/neuraldb/dataset
24 2
Files With Long Lines (Top 0)

There are 0 files with lines longer than 120 characters. In total, there are 0 long lines.

File# lines# units# long lines