facebookresearch / dpr-scale
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 57 files with 3,107 lines of code.
    • 0 very long files (0 lines of code)
    • 0 long files (0 lines of code)
    • 3 medium size files (687 lines of codeclsfd_ftr_w_mp_ins)
    • 10 small files (1,447 lines of code)
    • 44 very small files (973 lines of code)
0% | 0% | 22% | 46% | 31%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 0% | 25% | 53% | 20%
yaml0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
dpr_scale/task0% | 0% | 79% | 0% | 20%
dpr_scale/transforms0% | 0% | 79% | 0% | 20%
dpr_scale/datamodule0% | 0% | 99% | 0% | <1%
dpr_scale/utils0% | 0% | 0% | 77% | 22%
dpr_scale0% | 0% | 0% | 78% | 21%
dpr_scale/optim0% | 0% | 0% | 99% | <1%
dpr_scale/data_prep0% | 0% | 0% | 100% | 0%
dpr_scale/conf0% | 0% | 0% | 0% | 100%
dpr_scale/models0% | 0% | 0% | 0% | 100%
dpr_scale/conf/task0% | 0% | 0% | 0% | 100%
dpr_scale/conf/trainer0% | 0% | 0% | 0% | 100%
dpr_scale/conf/datamodule0% | 0% | 0% | 0% | 100%
dpr_scale/conf/checkpoint_callback0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
dpr_task.py
in dpr_scale/task
260 16
dpr_transform.py
in dpr_scale/transforms
220 5
dpr.py
in dpr_scale/datamodule
207 23
eval_dpr.py
in dpr_scale
180 20
prep_wiki_exp.py
in dpr_scale/utils
179 9
prep_ccnews.py
in dpr_scale/utils
169 13
tune_spar_weights.py
in dpr_scale/utils
160 5
msmarco_eval.py
in dpr_scale
155 11
run_retrieval_multiset.py
in dpr_scale
134 5
run_retrieval_fb.py
in dpr_scale
127 5
madgrad.py
in dpr_scale/optim
121 7
prep_wiki.py
in dpr_scale/utils
117 8
prep_conv_datasets.py
in dpr_scale/data_prep
105 9
run_retrieval.py
in dpr_scale
90 3
utils.py
in dpr_scale/utils
89 10
ccnews_stats.py
in dpr_scale/utils
87 6
dpr_eval_task.py
in dpr_scale/task
67 10
dstc7.yaml
in dpr_scale/conf
39 -
ubuntuv2.yaml
in dpr_scale/conf
37 -
convai2.yaml
in dpr_scale/conf
36 -
reddit.yaml
in dpr_scale/conf
36 -
orcas.yaml
in dpr_scale/conf
35 -
main.py
in dpr_scale
33 1
wiki_ict.yaml
in dpr_scale/conf
32 -
msmarco.yaml
in dpr_scale/conf
31 -
hf_model.py
in dpr_scale/models
30 2
ccnews_ict.yaml
in dpr_scale/conf
30 -
nq.yaml
in dpr_scale/conf
29 -
nq_roberta.yaml
in dpr_scale/conf
29 -
hf_bert.py
in dpr_scale/transforms
28 2
hf_transform.py
in dpr_scale/transforms
27 2
slurm.yaml
in dpr_scale/conf/trainer
25 -
config.py
in dpr_scale/conf
24 -
generate_query_embeddings.py
in dpr_scale
23 1
generate_embeddings.py
in dpr_scale
21 1
hf_bert.py
in dpr_scale/models
19 2
default.yaml
in dpr_scale/conf/datamodule
9 -
adamw.yaml
in dpr_scale/conf/task/optim
8 -
lamb.yaml
in dpr_scale/conf/task/optim
7 -
default.yaml
in dpr_scale/conf/checkpoint_callback
7 -
gpu_1_host.yaml
in dpr_scale/conf/trainer
7 -
dpr.yaml
in dpr_scale/conf/task
6 -
generate.yaml
in dpr_scale/conf/datamodule
5 -
nq_eval.yaml
in dpr_scale/conf
5 -
madgrad.yaml
in dpr_scale/conf/task/optim
5 -
hf_model.yaml
in dpr_scale/conf/task/model
4 -
hf_transform.yaml
in dpr_scale/conf/task/transform
3 -
__init__.py
in dpr_scale/datamodule
1 -
__init__.py
in dpr_scale/optim
1 -
__init__.py
in dpr_scale
1 -
Files With Most Units (Top 20)
File# lines# units
dpr.py
in dpr_scale/datamodule
207 23
eval_dpr.py
in dpr_scale
180 20
dpr_task.py
in dpr_scale/task
260 16
prep_ccnews.py
in dpr_scale/utils
169 13
msmarco_eval.py
in dpr_scale
155 11
utils.py
in dpr_scale/utils
89 10
dpr_eval_task.py
in dpr_scale/task
67 10
prep_wiki_exp.py
in dpr_scale/utils
179 9
prep_conv_datasets.py
in dpr_scale/data_prep
105 9
prep_wiki.py
in dpr_scale/utils
117 8
madgrad.py
in dpr_scale/optim
121 7
ccnews_stats.py
in dpr_scale/utils
87 6
run_retrieval_multiset.py
in dpr_scale
134 5
tune_spar_weights.py
in dpr_scale/utils
160 5
run_retrieval_fb.py
in dpr_scale
127 5
dpr_transform.py
in dpr_scale/transforms
220 5
run_retrieval.py
in dpr_scale
90 3
hf_model.py
in dpr_scale/models
30 2
hf_bert.py
in dpr_scale/models
19 2
hf_transform.py
in dpr_scale/transforms
27 2
Files With Long Lines (Top 3)

There are 3 files with lines longer than 120 characters. In total, there are 3 long lines.

File# lines# units# long lines
tune_spar_weights.py
in dpr_scale/utils
160 5 1
eval_dpr.py
in dpr_scale
180 20 1
msmarco_eval.py
in dpr_scale
155 11 1