facebookresearch / simmc2
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 17 files with 2,612 lines of code.
    • 0 very long files (0 lines of code)
    • 1 long files (842 lines of code)
    • 4 medium size files (959 lines of codeclsfd_ftr_w_mp_ins)
    • 1 small files (159 lines of code)
    • 11 very small files (652 lines of code)
0% | 32% | 36% | 6% | 24%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 32% | 36% | 6% | 24%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
mm_dst/gpt2_dst/scripts0% | 54% | 32% | 0% | 12%
mm_dst/gpt2_dst/utils0% | 0% | 100% | 0% | 0%
mm_dst/utils0% | 0% | 100% | 0% | 0%
disambiguate0% | 0% | 0% | 54% | 45%
utils0% | 0% | 0% | 0% | 100%
Longest Files (Top 17)
File# lines# units
run_language_modeling.py
in model/mm_dst/gpt2_dst/scripts
842 16
run_generation.py
in model/mm_dst/gpt2_dst/scripts
279 7
convert.py
in model/mm_dst/gpt2_dst/utils
241 4
evaluate_dst.py
in model/mm_dst/utils
225 11
run_retrieval.py
in model/mm_dst/gpt2_dst/scripts
214 7
train_model.py
in model/disambiguate
159 2
response_evaluation.py
in model/utils
93 3
visualize_bboxes.py
in model/utils
92 3
preprocess_input.py
in model/mm_dst/gpt2_dst/scripts
75 -
retrieval_evaluation.py
in model/utils
70 2
disambiguator_evaluation.py
in model/utils
66 2
format_disambiguation_data.py
in model/disambiguate
61 1
dataloader.py
in model/disambiguate
55 4
evaluate_response.py
in model/mm_dst/gpt2_dst/scripts
50 2
format_retrieval_results.py
in model/mm_dst/gpt2_dst/scripts
47 1
evaluate.py
in model/mm_dst/gpt2_dst/scripts
25 -
disambiguator.py
in model/disambiguate
18 2
Files With Most Units (Top 15)
File# lines# units
run_language_modeling.py
in model/mm_dst/gpt2_dst/scripts
842 16
evaluate_dst.py
in model/mm_dst/utils
225 11
run_generation.py
in model/mm_dst/gpt2_dst/scripts
279 7
run_retrieval.py
in model/mm_dst/gpt2_dst/scripts
214 7
convert.py
in model/mm_dst/gpt2_dst/utils
241 4
dataloader.py
in model/disambiguate
55 4
response_evaluation.py
in model/utils
93 3
visualize_bboxes.py
in model/utils
92 3
evaluate_response.py
in model/mm_dst/gpt2_dst/scripts
50 2
train_model.py
in model/disambiguate
159 2
disambiguator.py
in model/disambiguate
18 2
retrieval_evaluation.py
in model/utils
70 2
disambiguator_evaluation.py
in model/utils
66 2
format_retrieval_results.py
in model/mm_dst/gpt2_dst/scripts
47 1
format_disambiguation_data.py
in model/disambiguate
61 1
Files With Long Lines (Top 2)

There are 2 files with lines longer than 120 characters. In total, there are 10 long lines.

File# lines# units# long lines
run_language_modeling.py
in model/mm_dst/gpt2_dst/scripts
842 16 7
convert.py
in model/mm_dst/gpt2_dst/utils
241 4 3