facebookresearch / PAQ
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 32 files with 2,259 lines of code.
    • 0 very long files (0 lines of code)
    • 0 long files (0 lines of code)
    • 1 medium size files (454 lines of codeclsfd_ftr_w_mp_ins)
    • 9 small files (1,262 lines of code)
    • 22 very small files (543 lines of code)
0% | 0% | 20% | 55% | 24%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 0% | 20% | 55% | 24%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
paq0% | 0% | 74% | 25% | <1%
paq/generation/answer_extractor0% | 0% | 0% | 88% | 11%
paq/retrievers0% | 0% | 0% | 65% | 34%
paq/generation/filtering0% | 0% | 0% | 82% | 17%
paq/rerankers0% | 0% | 0% | 99% | <1%
paq/generation0% | 0% | 0% | 99% | <1%
paq/generation/question_generator0% | 0% | 0% | 78% | 21%
paq/generation/passage_scorer0% | 0% | 0% | 0% | 100%
paq/server0% | 0% | 0% | 0% | 100%
paq/evaluation0% | 0% | 0% | 0% | 100%
Longest Files (Top 32)
File# lines# units
download.py
in paq
454 7
filterer.py
in paq/generation/filtering
199 18
span2D_model.py
in paq/generation/answer_extractor
153 5
paq_utils.py
in paq
152 15
build_index.py
in paq/retrievers
135 4
retrieve.py
in paq/retrievers
131 8
rerank.py
in paq/rerankers
129 6
generate_qa_pairs.py
in paq/generation
125 9
generator.py
in paq/generation/question_generator
124 5
extractors.py
in paq/generation/answer_extractor
114 9
embed.py
in paq/retrievers
97 2
server.py
in paq/server
81 3
scorer.py
in paq/generation/passage_scorer
78 11
retriever_utils.py
in paq/retrievers
45 6
filter_questions.py
in paq/generation/filtering
42 3
score_passages.py
in paq/generation/passage_scorer
35 3
extract_answers.py
in paq/generation/answer_extractor
35 3
generate_questions.py
in paq/generation/question_generator
33 2
eval_retriever.py
in paq/evaluation
30 1
eval_reranker.py
in paq/evaluation
26 1
eval_utils.py
in paq/evaluation
25 3
client.py
in paq/server
6 -
__init__.py
in paq/retrievers
1 -
__init__.py
in paq/rerankers
1 -
__init__.py
in paq
1 -
__init__.py
in paq/server
1 -
__init__.py
in paq/evaluation
1 -
__init__.py
in paq/generation/filtering
1 -
__init__.py
in paq/generation
1 -
__init__.py
in paq/generation/question_generator
1 -
__init__.py
in paq/generation/passage_scorer
1 -
__init__.py
in paq/generation/answer_extractor
1 -
Files With Most Units (Top 20)
File# lines# units
filterer.py
in paq/generation/filtering
199 18
paq_utils.py
in paq
152 15
scorer.py
in paq/generation/passage_scorer
78 11
extractors.py
in paq/generation/answer_extractor
114 9
generate_qa_pairs.py
in paq/generation
125 9
retrieve.py
in paq/retrievers
131 8
download.py
in paq
454 7
retriever_utils.py
in paq/retrievers
45 6
rerank.py
in paq/rerankers
129 6
generator.py
in paq/generation/question_generator
124 5
span2D_model.py
in paq/generation/answer_extractor
153 5
build_index.py
in paq/retrievers
135 4
server.py
in paq/server
81 3
eval_utils.py
in paq/evaluation
25 3
filter_questions.py
in paq/generation/filtering
42 3
score_passages.py
in paq/generation/passage_scorer
35 3
extract_answers.py
in paq/generation/answer_extractor
35 3
embed.py
in paq/retrievers
97 2
generate_questions.py
in paq/generation/question_generator
33 2
eval_retriever.py
in paq/evaluation
30 1
Files With Long Lines (Top 12)

There are 12 files with lines longer than 120 characters. In total, there are 33 long lines.

File# lines# units# long lines
retrieve.py
in paq/retrievers
131 8 7
rerank.py
in paq/rerankers
129 6 7
embed.py
in paq/retrievers
97 2 6
generate_qa_pairs.py
in paq/generation
125 9 3
download.py
in paq
454 7 2
generate_questions.py
in paq/generation/question_generator
33 2 2
retriever_utils.py
in paq/retrievers
45 6 1
eval_retriever.py
in paq/evaluation
30 1 1
eval_reranker.py
in paq/evaluation
26 1 1
filter_questions.py
in paq/generation/filtering
42 3 1
score_passages.py
in paq/generation/passage_scorer
35 3 1
extract_answers.py
in paq/generation/answer_extractor
35 3 1