openai / evals - File Size

Intro

File size measurements show the distribution of size of files.
Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.

Learn more...

File Size Overall

There are 1,499 files with 41,930 lines of code.

2 very long files (2,363 lines of code)
0 long files (0 lines of code)
41 medium size files (11,786 lines of codeclsfd_ftr_w_mp_ins)
75 small files (10,548 lines of code)
1,381 very small files (17,233 lines of code)

Legend:

1001+

501-1000

201-500

101-200

1-100

explore: grouped by folders | grouped by size | sunburst | 3D view

File Size per Extension

1001+

501-1000

201-500

101-200

1-100

File Size per Logical Decomposition

primary

1001+

501-1000

201-500

101-200

1-100

Longest Files (Top 50)

File	# lines	# units
generate_samples.ipynb in evals/registry/data/backgammon	1349	-
actions.py in evals/elsuite/multistep_web_tasks/webarena/browser_env	1014	50
tools.py in evals/elsuite/bugged_tools	497	37
processors.py in evals/elsuite/multistep_web_tasks/webarena/browser_env	495	21
record.py in evals	450	54
make_plots.py in evals/elsuite/error_recovery/scripts	446	20
session.py in evals/elsuite/multistep_web_tasks	416	28
mmlu.yaml in evals/registry/evals	399	-
theory_of_mind.yaml in evals/registry/solvers	394	-
mmmu.yaml in evals/registry/evals	390	-
make_plots.py in evals/elsuite/identifying_variables/scripts	325	17
gen_data.py in evals/elsuite/identifying_variables/scripts	319	13
eval.py in evals/elsuite/skill_acquisition	313	7
plot_experiments.py in evals/elsuite/hr_ml_agent_bench/scripts	307	-
low_level_actions.py in evals/elsuite/hr_ml_agent_bench	304	13
defaults.yaml in evals/registry/solvers	294	-
environment.py in evals/elsuite/hr_ml_agent_bench	283	21
constants.py in evals/elsuite/multistep_web_tasks/webarena/browser_env	282	-
playwright_api.py in evals/elsuite/multistep_web_tasks/webarena/core	279	32
eval_run.py in evals/elsuite/multistep_web_tasks/webarena	277	14
evaluators.py in evals/elsuite/multistep_web_tasks/webarena/evaluation_harness	273	19
skill_acquisition.yaml in evals/registry/solvers	267	-
make_plots.py in evals/elsuite/already_said_that/scripts	263	8
oaieval.py in evals/cli	253	6
eval.py in evals/elsuite/incontext_rl	246	12
eval.py in evals/elsuite/function_deduction	244	14
registry.py in evals	242	27
dataset_creation.py in evals/elsuite/cant_do_that_anymore/scripts	235	8
make_plots.py in evals/elsuite/track_the_stat/scripts	235	9
make_plots.py in evals/elsuite/ballots/scripts	233	12
plot_experiments.py in evals/elsuite/incontext_rl/scripts	233	8
solve.py in evals/registry/data/solve-for-variable/tools	231	30
eval.py in evals/elsuite/identifying_variables	227	14
raven-matrices.yaml in evals/registry/evals	224	-
core.py in evals/elsuite/make_me_say	223	18
makemepay.py in evals/elsuite/make_me_pay	222	8
diagonal_dataset_creation.py in evals/elsuite/cant_do_that_anymore/scripts	216	6
corrset.py in evals/elsuite/identifying_variables/renderers	216	11
eval.py in evals/elsuite/self_prompting	210	7
eval.py in evals/elsuite/bugged_tools	210	9
cards.py in evals/elsuite/bluff/bluff	206	37
eval.py in evals/elsuite/error_recovery	204	9
pieces.py in evals/elsuite/cant_do_that_anymore/chess	203	9
custom_datasets.py in evals/elsuite/steganography/scripts/dataset	197	12
make_plots.py in evals/elsuite/function_deduction/scripts	195	4
basic_browser_env.py in evals/elsuite/multistep_web_tasks/webarena/browser_env	191	11
high_level_actions.py in evals/elsuite/hr_ml_agent_bench	191	4
utils.py in evals/elsuite/multistep_web_tasks/webarena/core	188	7
openai_assistants_solver.py in evals/solvers/providers/openai	186	11
modelgraded_generator.py in scripts	185	1

Files With Most Units (Top 50)

File	# lines	# units
record.py in evals	450	54
actions.py in evals/elsuite/multistep_web_tasks/webarena/browser_env	1014	50
cards.py in evals/elsuite/bluff/bluff	206	37
tools.py in evals/elsuite/bugged_tools	497	37
playwright_api.py in evals/elsuite/multistep_web_tasks/webarena/core	279	32
solve.py in evals/registry/data/solve-for-variable/tools	231	30
session.py in evals/elsuite/multistep_web_tasks	416	28
registry.py in evals	242	27
processors.py in evals/elsuite/multistep_web_tasks/webarena/browser_env	495	21
environment.py in evals/elsuite/hr_ml_agent_bench	283	21
data.py in evals	148	21
make_plots.py in evals/elsuite/error_recovery/scripts	446	20
evaluators.py in evals/elsuite/multistep_web_tasks/webarena/evaluation_harness	273	19
openai_solver.py in evals/solvers/providers/openai	181	18
core.py in evals/elsuite/make_me_say	223	18
solver.py in evals/solvers	125	17
make_plots.py in evals/elsuite/identifying_variables/scripts	325	17
wave_function_collapse.py in evals/registry/data/simple_physics_engine	157	17
solvers.py in evals/elsuite/sandbagging	152	16
utils.py in evals/elsuite/skill_acquisition	115	15
corpus.py in evals/registry/data/word_association/corpus_tools	58	15
eval.py in evals	170	15
eval_run.py in evals/elsuite/multistep_web_tasks/webarena	277	14
tabular.py in evals/elsuite/identifying_variables/renderers	125	14
eval.py in evals/elsuite/identifying_variables	227	14
eval.py in evals/elsuite/function_deduction	244	14
basic_bash_env.py in evals/elsuite/multistep_web_tasks/webarena/bash_env	163	13
players.py in evals/elsuite/bluff/bluff	107	13
board.py in evals/elsuite/cant_do_that_anymore/chess	162	13
gen_data.py in evals/elsuite/identifying_variables/scripts	319	13
low_level_actions.py in evals/elsuite/hr_ml_agent_bench	304	13
solvers.py in evals/elsuite/track_the_stat	72	13
utils.py in evals/elsuite	150	13
related_words.py in evals/registry/data/word_association/corpus_tools	64	13
strong_solver.py in evals/elsuite/multistep_web_tasks/solvers/strong_solver	173	12
make_plots.py in evals/elsuite/ballots/scripts	233	12
utils.py in evals/elsuite/cant_do_that_anymore	178	12
custom_datasets.py in evals/elsuite/steganography/scripts/dataset	197	12
eval.py in evals/elsuite/incontext_rl	246	12
openai_assistants_solver.py in evals/solvers/providers/openai	186	11
basic_browser_env.py in evals/elsuite/multistep_web_tasks/webarena/browser_env	191	11
corrset.py in evals/elsuite/identifying_variables/renderers	216	11
solver_tools_convo.py in evals/elsuite	181	11
validators.py in evals/registry/data/word_association/corpus_tools	151	11
together_solver.py in evals/solvers/providers/together	68	10
baselines.py in evals/elsuite/function_deduction	91	10
openai.py in evals/completion_fns	147	10
gemini_solver.py in evals/solvers/providers/google	157	9
eval.py in evals/elsuite/error_recovery	204	9
eval.py in evals/elsuite/bluff	164	9

Files With Long Lines (Top 50)

There are 132 files with lines longer than 120 characters. In total, there are 335 long lines.

File	# lines	# units	# long lines
raven-matrices.yaml in evals/registry/evals	224	-	28
tools.py in evals/elsuite/bugged_tools	497	37	26
hr-ml-agent-bench.yaml in evals/registry/evals	137	-	16
generate_samples.ipynb in evals/registry/data/backgammon	1349	-	15
high_level_actions.py in evals/elsuite/hr_ml_agent_bench	191	4	12
plot_experiments.py in evals/elsuite/incontext_rl/scripts	233	8	11
csv_to_json.py in evals/registry/data/canto_wu_pronunciation	55	-	8
theory_of_mind.yaml in evals/registry/evals	48	-	8
prompts.py in evals/elsuite/ballots	44	-	7
oaieval.py in evals/cli	253	6	7
task_description.py in evals/elsuite/make_me_pay	57	-	6
utils.py in evals/elsuite/twenty_questions	47	4	6
helper_functions.py in evals/elsuite/multistep_web_tasks/webarena/browser_env	129	5	5
prompts.py in evals/elsuite/make_me_pay/solvers	18	-	5
low_level_actions.py in evals/elsuite/hr_ml_agent_bench	304	13	5
defaults.py in evals/elsuite/make_me_say	34	5	5
singlestore.yaml in evals/registry/modelgraded	24	-	5
fewshot_solver.py in evals/solvers/nested	91	5	4
session.py in evals/elsuite/multistep_web_tasks	416	28	4
sql.yaml in evals/registry/modelgraded	24	-	4
product-ie.yaml in evals/registry/evals	28	-	4
defaults.py in evals/elsuite/error_recovery	12	-	3
data_generation.py in evals/elsuite/theory_of_mind/scripts	66	1	3
eval.py in evals/elsuite/self_prompting	210	7	3
environment.py in evals/elsuite/hr_ml_agent_bench	283	21	3
utils.py in evals/elsuite/skill_acquisition	115	15	3
solvers.py in evals/elsuite/function_deduction	140	9	3
self_prompting.yaml in evals/registry/solvers	96	-	3
research-question-extraction.yaml in evals/registry/modelgraded	19	-	3
self_consistency_solver.py in evals/solvers/nested	118	6	2
actions.py in evals/elsuite/multistep_web_tasks/webarena/browser_env	1014	50	2
eval.py in evals/elsuite/make_me_pay	126	3	2
task_description.py in evals/elsuite/bugged_tools	9	-	2
solvers.py in evals/elsuite/identifying_variables	27	4	2
classify_utils.py in evals/elsuite/modelgraded	145	8	2
eval.py in evals/elsuite/skill_acquisition	313	7	2
eval.py in evals/elsuite/mmmu	159	4	2
convert.js in evals/registry/data/medmcqa	44	1	2
convert.js in evals/registry/data/unsolvable_questions	51	1	2
findFailures.js in evals/registry/data/unsolvable_questions	43	1	2
fact.yaml in evals/registry/modelgraded	19	-	2
keywords.yaml in evals/registry/modelgraded	20	-	2
arithmetic-expression.yaml in evals/registry/modelgraded	24	-	2
translation.yaml in evals/registry/modelgraded	19	-	2
abstract-causal-reasoning.yaml in evals/registry/evals	16	-	2
openai.py in evals/completion_fns	147	10	2
battle_generator.py in scripts	49	1	1
cot.py in evals/solvers/prompts	4	-	1
hhh.py in evals/solvers/prompts	99	-	1
openai_solver.py in evals/solvers/providers/openai	181	18	1

Correlations

File Size vs. Commits (all time): 1496 points

		1349.0	lines of code min: 1.0 average: 28.01 25th percentile: 3.0 median: 3.0 75th percentile: 16.0 max: 1349.0
0	32.0
commits (all time) min: 1.0 \| average: 1.29 \| 25th percentile: 1.0 \| median: 1.0 \| 75th percentile: 1.0 \| max: 32.0

File Size vs. Contributors (all time): 1496 points

		1349.0	lines of code min: 1.0 average: 28.01 25th percentile: 3.0 median: 3.0 75th percentile: 16.0 max: 1349.0
0	19.0
contributors (all time) min: 1.0 \| average: 1.19 \| 25th percentile: 1.0 \| median: 1.0 \| 75th percentile: 1.0 \| max: 19.0

File Size vs. Commits (30 days): 0 points

No data for "commits (30d)" vs. "lines of code".

File Size vs. Contributors (30 days): 0 points

No data for "contributors (30d)" vs. "lines of code".

File Size vs. Commits (90 days): 0 points

No data for "commits (90d)" vs. "lines of code".

File Size vs. Contributors (90 days): 0 points

No data for "contributors (90d)" vs. "lines of code".