openai / evals
File Age & Freshness

File age measurements show the distribution of file ages (days since the first commit) and the file freshness (days since the latest commit).

Summary
File Change History Overall
File Age Distribution Overall
Days since first update
  • There are 1,496 files with 41,910 lines of code in files.
    • 1,494 files that are 366+ days old (41,899 lines of code)
    • 2 files that are 181-365 days old (11 lines of code)
    • 0 files that are 91-180 days old (0 lines of code)
    • 0 files that are 31-90 days old (0 lines of code)
    • 0 files that are 1-30 days old (0 lines of code)
99% | <1% | 0% | 0% | 0%
Legend:
366+
181-365
91-180
31-90
1-30

explore: grouped by folders | grouped by age
File Freshness Distribution Overall
Days since last update
  • There are 1,496 files with 41,910 lines of code in files.
    • 1,493 files have been last changed 366+ days ago (41,646 lines of code)
    • 3 files have been last changed 181-365 days ago (264 lines of code)
    • 0 files have been last changed 91-180 days ago (0 lines of code)
    • 0 files have been last changed 31-90 days ago (0 lines of code)
    • 0 files have been last changed 1-30 days ago (0 lines of code)
99% | <1% | 0% | 0% | 0%
Legend:
366+
181-365
91-180
31-90
1-30

explore: grouped by folders | grouped by freshness
File Change History per File Extension
jsonl, yaml, py, md, txt, sh, json, gitignore, ipynb, gitattributes, html, js, in, ini, toml
File Age Distribution per Extension
Days since first update
366+
181-365
91-180
31-90
1-30
py100% | 0% | 0% | 0% | 0%
yaml99% | <1% | 0% | 0% | 0%
jsonl99% | <1% | 0% | 0% | 0%
ipynb100% | 0% | 0% | 0% | 0%
html100% | 0% | 0% | 0% | 0%
js100% | 0% | 0% | 0% | 0%
toml100% | 0% | 0% | 0% | 0%
in100% | 0% | 0% | 0% | 0%
File Freshness Distribution per Extension
Days since last update
366+
181-365
91-180
31-90
1-30
py99% | <1% | 0% | 0% | 0%
yaml99% | <1% | 0% | 0% | 0%
jsonl99% | <1% | 0% | 0% | 0%
ipynb100% | 0% | 0% | 0% | 0%
html100% | 0% | 0% | 0% | 0%
js100% | 0% | 0% | 0% | 0%
toml100% | 0% | 0% | 0% | 0%
in100% | 0% | 0% | 0% | 0%
File Change History per Logical Decomposition
primary
primary (file age distribution)
Days since first update
366+
181-365
91-180
31-90
1-30
evals99% | <1% | 0% | 0% | 0%
scripts100% | 0% | 0% | 0% | 0%
ROOT100% | 0% | 0% | 0% | 0%
primary (file freshness distribution)
Days since last update
366+
181-365
91-180
31-90
1-30
evals99% | <1% | 0% | 0% | 0%
scripts100% | 0% | 0% | 0% | 0%
ROOT100% | 0% | 0% | 0% | 0%
Oldest Files (Top 50)
File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
jsonl
reverse_string.jsonl
in evals/registry/data/reverse_string
3 - 2023-03-14 2023-03-14 1 2 120423412+andrew-openai@use... logan@openai.com
reverse-string.yaml
in evals/registry/evals
3 - 2023-03-14 2023-03-14 1 2 120423412+andrew-openai@use... logan@openai.com
48 3 2023-03-15 2024-01-10 6 6 jasonwei@openai.com 140545726+ianmckenzie-oai@u...
7 - 2023-03-15 2023-03-16 2 2 omattos@gmail.com 120423412+andrew-openai@use...
cube-pack.yaml
in evals/registry/evals
7 - 2023-03-15 2023-03-16 2 2 emil@radix.ai 120423412+andrew-openai@use...
pattern_identification.yaml
in evals/registry/evals
7 - 2023-03-15 2023-03-16 2 2 jasonwei@openai.com 120423412+andrew-openai@use...
jsonl
samples.jsonl
in evals/registry/data/map-electronic-component-part-to-fact
3 - 2023-03-15 2023-03-16 2 2 omattos@gmail.com 120423412+andrew-openai@use...
jsonl
samples.jsonl
in evals/registry/data/cube-pack
3 - 2023-03-15 2023-03-16 2 2 emil@radix.ai emil@radix.ai
jsonl
samples.v0.jsonl
in evals/registry/data/pattern_identification
3 - 2023-03-15 2023-03-16 2 2 jasonwei@openai.com jasonwei@openai.com
jsonl
born_first.jsonl
in evals/registry/data/born_first
3 - 2023-03-15 2023-03-16 2 2 njbbaer@gmail.com 120423412+andrew-openai@use...
born-first.yaml
in evals/registry/evals
3 - 2023-03-15 2023-03-16 2 2 njbbaer@gmail.com 120423412+andrew-openai@use...
record.py
in evals
450 54 2023-03-16 2024-01-26 13 9 shane@openai.com 140545726+ianmckenzie-oai@u...
185 1 2023-03-16 2024-01-10 6 5 shane@openai.com 140545726+ianmckenzie-oai@u...
classify.py
in evals/elsuite/modelgraded
97 3 2023-03-16 2023-09-26 19 7 shane@openai.com bomarni@googlemail.com
49 1 2023-03-16 2024-01-10 6 5 shane@openai.com 140545726+ianmckenzie-oai@u...
test-modelgraded-battle.yaml
in evals/registry/evals
36 - 2023-03-16 2023-04-24 5 3 shane@openai.com 343165+rlbayes@users.norepl...
test-all.yaml
in evals/registry/eval_sets
21 - 2023-03-16 2023-04-24 5 3 shane@openai.com 343165+rlbayes@users.norepl...
test-modelgraded-generated.yaml
in evals/registry/evals
9 - 2023-03-16 2023-03-27 3 2 shane@openai.com 343165+rlbayes@users.norepl...
balance-chemical-equation.yaml
in evals/registry/evals
7 - 2023-03-16 2023-03-16 1 2 120423412+andrew-openai@use... scruelt@hotmail.com
jsonl
fuzzy_match.jsonl
in evals/registry/data/chess_piece_count
3 - 2023-03-16 2023-03-16 1 2 120423412+andrew-openai@use... jatinparab98@gmail.com
jsonl
samples.jsonl
in evals/registry/data/balance_chemical_equation
3 - 2023-03-16 2023-05-21 2 2 120423412+andrew-openai@use... scruelt@hotmail.com
chess-piece-count.yaml
in evals/registry/evals
3 - 2023-03-16 2023-03-16 1 2 120423412+andrew-openai@use... jatinparab98@gmail.com
data.py
in evals
148 21 2023-03-17 2024-04-02 9 10 343165+rlbayes@users.norepl... 150190178+josnyder-2@users....
coqa-ex.yaml
in evals/registry/evals
55 - 2023-03-17 2023-04-24 4 3 343165+rlbayes@users.norepl... 343165+rlbayes@users.norepl...
test-modelgraded.yaml
in evals/registry/eval_sets
14 - 2023-03-17 2023-04-06 3 2 343165+rlbayes@users.norepl... 343165+rlbayes@users.norepl...
coqa-ex.yaml
in evals/registry/eval_sets
7 - 2023-03-17 2023-04-24 2 2 343165+rlbayes@users.norepl... 343165+rlbayes@users.norepl...
oaieval.py
in evals/cli
253 6 2023-03-18 2024-09-30 19 18 120423412+andrew-openai@use... steven@openai.com
api_utils.py
in evals/utils
15 1 2023-03-18 2024-03-26 10 8 120423412+andrew-openai@use... oliver.jaffe@hotmail.co.uk
eval.py
in evals
170 15 2023-03-20 2024-03-28 9 9 2406911+zhangmarvin@users.n... oliver.jaffe@hotmail.co.uk
test-modelgraded.yaml
in evals/registry/evals
81 - 2023-03-20 2023-09-18 7 3 343165+rlbayes@users.norepl... 55913678+cholotook@users.no...
base.py
in evals
51 1 2023-03-20 2023-09-26 5 4 343165+rlbayes@users.norepl... lukevanseters@gmail.com
best.yaml
in evals/registry/modelgraded
10 - 2023-03-20 2023-09-18 4 2 343165+rlbayes@users.norepl... 55913678+cholotook@users.no...
bigrams.yaml
in evals/registry/evals
7 - 2023-03-20 2023-03-20 1 1 oscar-king@users.noreply.gi... oscar-king@users.noreply.gi...
lat_long_identify.yaml
in evals/registry/evals
7 - 2023-03-20 2023-03-20 1 1 vishaal16119@iiitd.ac.in vishaal16119@iiitd.ac.in
jsonl
infiniteloop-match.jsonl
in evals/registry/data/infiniteloop-match
3 - 2023-03-20 2023-03-20 1 1 44745172+dottedant-dooz@use... 44745172+dottedant-dooz@use...
jsonl
samples.jsonl
in evals/registry/data/bigrams
3 - 2023-03-20 2023-03-20 1 1 oscar-king@users.noreply.gi... oscar-king@users.noreply.gi...
jsonl
samples.jsonl
in evals/registry/data/reasoning
3 - 2023-03-20 2023-03-20 1 1 oscar-king@users.noreply.gi... oscar-king@users.noreply.gi...
jsonl
samples.jsonl
in evals/registry/data/lat_long_identify
3 - 2023-03-20 2023-03-20 1 1 vishaal16119@iiitd.ac.in vishaal16119@iiitd.ac.in
jsonl
samples.jsonl
in evals/registry/data/last_word_nth
3 - 2023-03-20 2023-03-20 1 1 trevor.annedenise@icloud.com trevor.annedenise@icloud.com
infiniteloop-match.yaml
in evals/registry/evals
3 - 2023-03-20 2023-03-20 1 1 44745172+dottedant-dooz@use... 44745172+dottedant-dooz@use...
last-word-nth.yaml
in evals/registry/evals
3 - 2023-03-20 2023-03-20 1 1 trevor.annedenise@icloud.com trevor.annedenise@icloud.com
registry.py
in evals
242 27 2023-03-21 2024-03-28 22 17 343165+rlbayes@users.norepl... oliver.jaffe@hotmail.co.uk
oaievalset.py
in evals/cli
111 8 2023-03-21 2024-01-10 5 4 343165+rlbayes@users.norepl... 140545726+ianmckenzie-oai@u...
64 - 2023-03-21 2024-05-01 20 19 1520816+andremafei@users.no... erik.t.ritter@gmail.com
anagrams.yaml
in evals/registry/evals
10 - 2023-03-21 2023-03-21 1 1 97240159+l-mutricy@users.no... 97240159+l-mutricy@users.no...
test-comp-sci.yaml
in evals/registry/evals
9 - 2023-03-21 2023-03-21 1 1 samennis127@gmail.com samennis127@gmail.com
convert-hex-hsl-lightness.yaml
in evals/registry/evals
8 - 2023-03-21 2023-03-21 1 1 harley@hturan.com harley@hturan.com
complex-replace-characters.yaml
in evals/registry/evals
8 - 2023-03-21 2023-03-21 1 1 petrgazarov@gmail.com petrgazarov@gmail.com
decrypt-caesar-cipher.yaml
in evals/registry/evals
7 - 2023-03-21 2023-03-21 1 1 me@mattfalconer.com me@mattfalconer.com
connect-4.yaml
in evals/registry/evals
7 - 2023-03-21 2023-03-21 1 1 83553535+kierandon@users.no... 83553535+kierandon@users.no...
Files Not Recently Changed (Top 50)
File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
reverse-string.yaml
in evals/registry/evals
3 - 2023-03-14 2023-03-14 1 2 120423412+andrew-openai@use... logan@openai.com
jsonl
reverse_string.jsonl
in evals/registry/data/reverse_string
3 - 2023-03-14 2023-03-14 1 2 120423412+andrew-openai@use... logan@openai.com
born-first.yaml
in evals/registry/evals
3 - 2023-03-15 2023-03-16 2 2 njbbaer@gmail.com 120423412+andrew-openai@use...
chess-piece-count.yaml
in evals/registry/evals
3 - 2023-03-16 2023-03-16 1 2 120423412+andrew-openai@use... jatinparab98@gmail.com
jsonl
born_first.jsonl
in evals/registry/data/born_first
3 - 2023-03-15 2023-03-16 2 2 njbbaer@gmail.com 120423412+andrew-openai@use...
jsonl
samples.v0.jsonl
in evals/registry/data/pattern_identification
3 - 2023-03-15 2023-03-16 2 2 jasonwei@openai.com jasonwei@openai.com
jsonl
samples.jsonl
in evals/registry/data/cube-pack
3 - 2023-03-15 2023-03-16 2 2 emil@radix.ai emil@radix.ai
jsonl
fuzzy_match.jsonl
in evals/registry/data/chess_piece_count
3 - 2023-03-16 2023-03-16 1 2 120423412+andrew-openai@use... jatinparab98@gmail.com
jsonl
samples.jsonl
in evals/registry/data/map-electronic-component-part-to-fact
3 - 2023-03-15 2023-03-16 2 2 omattos@gmail.com 120423412+andrew-openai@use...
balance-chemical-equation.yaml
in evals/registry/evals
7 - 2023-03-16 2023-03-16 1 2 120423412+andrew-openai@use... scruelt@hotmail.com
pattern_identification.yaml
in evals/registry/evals
7 - 2023-03-15 2023-03-16 2 2 jasonwei@openai.com 120423412+andrew-openai@use...
cube-pack.yaml
in evals/registry/evals
7 - 2023-03-15 2023-03-16 2 2 emil@radix.ai 120423412+andrew-openai@use...
7 - 2023-03-15 2023-03-16 2 2 omattos@gmail.com 120423412+andrew-openai@use...
last-word-nth.yaml
in evals/registry/evals
3 - 2023-03-20 2023-03-20 1 1 trevor.annedenise@icloud.com trevor.annedenise@icloud.com
infiniteloop-match.yaml
in evals/registry/evals
3 - 2023-03-20 2023-03-20 1 1 44745172+dottedant-dooz@use... 44745172+dottedant-dooz@use...
jsonl
samples.jsonl
in evals/registry/data/last_word_nth
3 - 2023-03-20 2023-03-20 1 1 trevor.annedenise@icloud.com trevor.annedenise@icloud.com
jsonl
samples.jsonl
in evals/registry/data/lat_long_identify
3 - 2023-03-20 2023-03-20 1 1 vishaal16119@iiitd.ac.in vishaal16119@iiitd.ac.in
jsonl
samples.jsonl
in evals/registry/data/reasoning
3 - 2023-03-20 2023-03-20 1 1 oscar-king@users.noreply.gi... oscar-king@users.noreply.gi...
jsonl
samples.jsonl
in evals/registry/data/bigrams
3 - 2023-03-20 2023-03-20 1 1 oscar-king@users.noreply.gi... oscar-king@users.noreply.gi...
jsonl
infiniteloop-match.jsonl
in evals/registry/data/infiniteloop-match
3 - 2023-03-20 2023-03-20 1 1 44745172+dottedant-dooz@use... 44745172+dottedant-dooz@use...
lat_long_identify.yaml
in evals/registry/evals
7 - 2023-03-20 2023-03-20 1 1 vishaal16119@iiitd.ac.in vishaal16119@iiitd.ac.in
bigrams.yaml
in evals/registry/evals
7 - 2023-03-20 2023-03-20 1 1 oscar-king@users.noreply.gi... oscar-king@users.noreply.gi...
chess.yaml
in evals/registry/evals
3 - 2023-03-21 2023-03-21 1 1 t.zehle@gmail.com t.zehle@gmail.com
belarusian-lexicon.yaml
in evals/registry/evals
3 - 2023-03-21 2023-03-21 1 1 50818265+somerandomguyonthe... 50818265+somerandomguyonthe...
jsonl
samples.jsonl
in evals/registry/data/aba_mrpc_true_false
3 - 2023-03-21 2023-03-21 1 1 avery@offerfit.ai avery@offerfit.ai
jsonl
samples.jsonl
in evals/registry/data/complex_replace_characters
3 - 2023-03-21 2023-03-21 1 1 petrgazarov@gmail.com petrgazarov@gmail.com
jsonl
samples.jsonl
in evals/registry/data/crepe
3 - 2023-03-21 2023-03-21 1 1 46582003+seacowx@users.nore... 46582003+seacowx@users.nore...
jsonl
samples.jsonl
in evals/registry/data/connect4
3 - 2023-03-21 2023-03-21 1 1 83553535+kierandon@users.no... 83553535+kierandon@users.no...
jsonl
samples.jsonl
in evals/registry/data/belarusian_lexicon
3 - 2023-03-21 2023-03-21 1 1 50818265+somerandomguyonthe... 50818265+somerandomguyonthe...
jsonl
fewshot.jsonl
in evals/registry/data/anagrams
3 - 2023-03-21 2023-03-21 1 1 97240159+l-mutricy@users.no... 97240159+l-mutricy@users.no...
jsonl
samples.jsonl
in evals/registry/data/anagrams
3 - 2023-03-21 2023-03-21 1 1 97240159+l-mutricy@users.no... 97240159+l-mutricy@users.no...
jsonl
samples.jsonl
in evals/registry/data/determinant
3 - 2023-03-21 2023-03-21 1 1 80975912+vitoraqdev@users.n... 80975912+vitoraqdev@users.n...
jsonl
match.jsonl
in evals/registry/data/chess
3 - 2023-03-21 2023-03-21 1 1 t.zehle@gmail.com t.zehle@gmail.com
jsonl
samples.jsonl
in evals/registry/data/decrypt_caesar_cipher
3 - 2023-03-21 2023-03-21 1 1 me@mattfalconer.com me@mattfalconer.com
jsonl
samples.jsonl
in evals/registry/data/convert-hex-hsl-lightness
3 - 2023-03-21 2023-03-21 1 1 harley@hturan.com harley@hturan.com
determinant.yaml
in evals/registry/evals
7 - 2023-03-21 2023-03-21 1 1 80975912+vitoraqdev@users.n... 80975912+vitoraqdev@users.n...
crepe.yaml
in evals/registry/evals
7 - 2023-03-21 2023-03-21 1 1 46582003+seacowx@users.nore... 46582003+seacowx@users.nore...
aba-mrpc-true-false.yaml
in evals/registry/evals
7 - 2023-03-21 2023-03-21 1 1 avery@offerfit.ai avery@offerfit.ai
connect-4.yaml
in evals/registry/evals
7 - 2023-03-21 2023-03-21 1 1 83553535+kierandon@users.no... 83553535+kierandon@users.no...
decrypt-caesar-cipher.yaml
in evals/registry/evals
7 - 2023-03-21 2023-03-21 1 1 me@mattfalconer.com me@mattfalconer.com
complex-replace-characters.yaml
in evals/registry/evals
8 - 2023-03-21 2023-03-21 1 1 petrgazarov@gmail.com petrgazarov@gmail.com
convert-hex-hsl-lightness.yaml
in evals/registry/evals
8 - 2023-03-21 2023-03-21 1 1 harley@hturan.com harley@hturan.com
test-comp-sci.yaml
in evals/registry/evals
9 - 2023-03-21 2023-03-21 1 1 samennis127@gmail.com samennis127@gmail.com
anagrams.yaml
in evals/registry/evals
10 - 2023-03-21 2023-03-21 1 1 97240159+l-mutricy@users.no... 97240159+l-mutricy@users.no...
formal_logic.yaml
in evals/registry/evals
3 - 2023-03-22 2023-03-22 1 1 christopher@wolfram.com christopher@wolfram.com
jsonl
samples.jsonl
in evals/registry/data/forth_stack_sim
3 - 2023-03-22 2023-03-22 1 1 gooseus@users.noreply.githu... gooseus@users.noreply.githu...
jsonl
samples.jsonl
in evals/registry/data/first-letters
3 - 2023-03-22 2023-03-22 1 1 67751757+kallyaleksiev@user... 67751757+kallyaleksiev@user...
jsonl
samples.jsonl
in evals/registry/data/diagrammatic_logic
3 - 2023-03-22 2023-03-22 1 1 freddie.nicholson123@gmail.com freddie.nicholson123@gmail.com
jsonl
formal_logic_expressions.jsonl
in evals/registry/data/formal_logic
3 - 2023-03-22 2023-03-22 1 1 christopher@wolfram.com christopher@wolfram.com
jsonl
full_samples.jsonl
in evals/registry/data/poker_hand_ranks
3 - 2023-03-22 2023-03-22 1 1 54050333+msilva-00@users.no... 54050333+msilva-00@users.no...
Most Recently Created Files (Top 50)
File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
14 1
jsonl
match.jsonl
in evals/registry/data/coqa
3 -
jsonl
samples.jsonl
in evals/registry/data/coqa
3 -
imo_exact_answers.yaml
in evals/registry/evals
8 - 2024-07-13 2024-07-13 1 1 justin@lin.bot justin@lin.bot
jsonl
samples.jsonl
in evals/registry/data/imo_exact_answers
3 - 2024-07-13 2024-07-13 1 1 justin@lin.bot justin@lin.bot
openai_assistants_solver.py
in evals/solvers/providers/openai
186 11 2024-03-28 2024-03-28 1 1 oliver.jaffe@hotmail.co.uk oliver.jaffe@hotmail.co.uk
openai_solver.py
in evals/solvers/providers/openai
181 18 2024-03-28 2024-03-28 1 1 oliver.jaffe@hotmail.co.uk oliver.jaffe@hotmail.co.uk
together_solver.py
in evals/solvers/providers/together
68 10 2024-03-28 2024-03-28 1 1 oliver.jaffe@hotmail.co.uk oliver.jaffe@hotmail.co.uk
memory.py
in evals/solvers
50 3 2024-03-28 2024-03-28 1 1 oliver.jaffe@hotmail.co.uk oliver.jaffe@hotmail.co.uk
solver_completion_fn.py
in evals/completion_fns
47 4 2024-03-28 2024-03-28 1 1 oliver.jaffe@hotmail.co.uk oliver.jaffe@hotmail.co.uk
gemini_solver.py
in evals/solvers/providers/google
157 9 2024-03-26 2024-03-26 1 1 oliver.jaffe@hotmail.co.uk oliver.jaffe@hotmail.co.uk
gemini.yaml
in evals/registry/solvers
15 - 2024-03-26 2024-03-26 1 1 oliver.jaffe@hotmail.co.uk oliver.jaffe@hotmail.co.uk
together.yaml
in evals/registry/solvers
85 - 2024-03-22 2024-03-28 2 2 giulio.starace@gmail.com oliver.jaffe@hotmail.co.uk
actions.py
in evals/elsuite/multistep_web_tasks/webarena/browser_env
1014 50 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
processors.py
in evals/elsuite/multistep_web_tasks/webarena/browser_env
495 21 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
session.py
in evals/elsuite/multistep_web_tasks
416 28 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
plot_experiments.py
in evals/elsuite/hr_ml_agent_bench/scripts
307 - 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
low_level_actions.py
in evals/elsuite/hr_ml_agent_bench
304 13 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
environment.py
in evals/elsuite/hr_ml_agent_bench
283 21 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
constants.py
in evals/elsuite/multistep_web_tasks/webarena/browser_env
282 - 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
playwright_api.py
in evals/elsuite/multistep_web_tasks/webarena/core
279 32 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
eval_run.py
in evals/elsuite/multistep_web_tasks/webarena
277 14 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
evaluators.py
in evals/elsuite/multistep_web_tasks/webarena/evaluation_harness
273 19 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
basic_browser_env.py
in evals/elsuite/multistep_web_tasks/webarena/browser_env
191 11 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
high_level_actions.py
in evals/elsuite/hr_ml_agent_bench
191 4 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
utils.py
in evals/elsuite/multistep_web_tasks/webarena/core
188 7 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
strong_solver.py
in evals/elsuite/multistep_web_tasks/solvers/strong_solver
173 12 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
autoeval.py
in evals/elsuite/hr_ml_agent_bench
172 2 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
app.py
in evals/elsuite/multistep_web_tasks/docker/flask-playwright
165 8 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
basic_bash_env.py
in evals/elsuite/multistep_web_tasks/webarena/bash_env
163 13 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
hr-ml-agent-bench.yaml
in evals/registry/evals
137 - 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
helper_functions.py
in evals/elsuite/multistep_web_tasks/webarena/browser_env
129 5 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
train.py
in evals/elsuite/hr_ml_agent_bench/benchmarks/ogbn_arxiv/env
126 5 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
train.py
in evals/elsuite/hr_ml_agent_bench/benchmarks/parkinsons_disease/env
124 2 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
train.py
in evals/elsuite/hr_ml_agent_bench/benchmarks/vectorization/env
118 5 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
utils.py
in evals/elsuite/hr_ml_agent_bench
112 8 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
helper_functions.py
in evals/elsuite/multistep_web_tasks/webarena/evaluation_harness
110 7 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
index.html
in evals/elsuite/multistep_web_tasks/docker/homepage/templates
108 - 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
calculator.html
in evals/elsuite/multistep_web_tasks/docker/homepage/templates
106 - 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
scratchpad.html
in evals/elsuite/multistep_web_tasks/docker/homepage/templates
105 - 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
train.py
in evals/elsuite/hr_ml_agent_bench/benchmarks/cifar10/env
105 4 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
auto_login.py
in evals/elsuite/multistep_web_tasks/webarena/browser_env
100 3 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
prepare.py
in evals/elsuite/hr_ml_agent_bench/benchmarks/parkinsons_disease/scripts
98 1 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
webarena_solvers.py
in evals/elsuite/multistep_web_tasks/solvers/webarena_solvers
94 8 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
make_plots.py
in evals/elsuite/multistep_web_tasks/reproducibility
94 6 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
eval.py
in evals/elsuite/hr_ml_agent_bench
91 5 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
baseline.py
in evals/elsuite/hr_ml_agent_bench/solvers
90 3 2024-03-21 2024-03-28 2 2 danesherbs@users.noreply.gi... oliver.jaffe@hotmail.co.uk
anthropic.yaml
in evals/registry/solvers
90 - 2024-03-21 2024-03-21 1 1 giulio.starace@gmail.com giulio.starace@gmail.com
anthropic_solver.py
in evals/solvers/providers/anthropic
89 7 2024-03-21 2024-03-26 2 2 giulio.starace@gmail.com oliver.jaffe@hotmail.co.uk
bash_browser_env.py
in evals/elsuite/multistep_web_tasks/webarena/bash_browser_env
89 7 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
Most Recently Changed Files (Top 50)
File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
14 1
jsonl
match.jsonl
in evals/registry/data/coqa
3 -
jsonl
samples.jsonl
in evals/registry/data/coqa
3 -
oaieval.py
in evals/cli
253 6 2023-03-18 2024-09-30 19 18 120423412+andrew-openai@use... steven@openai.com
imo_exact_answers.yaml
in evals/registry/evals
8 - 2024-07-13 2024-07-13 1 1 justin@lin.bot justin@lin.bot
jsonl
samples.jsonl
in evals/registry/data/imo_exact_answers
3 - 2024-07-13 2024-07-13 1 1 justin@lin.bot justin@lin.bot
64 - 2023-03-21 2024-05-01 20 19 1520816+andremafei@users.no... erik.t.ritter@gmail.com
data.py
in evals
148 21 2023-03-17 2024-04-02 9 10 343165+rlbayes@users.norepl... 150190178+josnyder-2@users....
theory_of_mind.yaml
in evals/registry/solvers
394 - 2024-01-29 2024-03-28 3 2 junshern@users.noreply.gith... oliver.jaffe@hotmail.co.uk
defaults.yaml
in evals/registry/solvers
294 - 2024-01-29 2024-03-28 3 2 junshern@users.noreply.gith... oliver.jaffe@hotmail.co.uk
skill_acquisition.yaml
in evals/registry/solvers
267 - 2024-03-19 2024-03-28 2 2 inwaves@users.noreply.githu... oliver.jaffe@hotmail.co.uk
registry.py
in evals
242 27 2023-03-21 2024-03-28 22 17 343165+rlbayes@users.norepl... oliver.jaffe@hotmail.co.uk
openai_assistants_solver.py
in evals/solvers/providers/openai
186 11 2024-03-28 2024-03-28 1 1 oliver.jaffe@hotmail.co.uk oliver.jaffe@hotmail.co.uk
openai_solver.py
in evals/solvers/providers/openai
181 18 2024-03-28 2024-03-28 1 1 oliver.jaffe@hotmail.co.uk oliver.jaffe@hotmail.co.uk
function_deduction.yaml
in evals/registry/solvers
174 - 2024-03-19 2024-03-28 2 2 129281094+james-aung@users.... oliver.jaffe@hotmail.co.uk
eval.py
in evals
170 15 2023-03-20 2024-03-28 9 9 2406911+zhangmarvin@users.n... oliver.jaffe@hotmail.co.uk
solvers.py
in evals/elsuite/function_deduction
140 9 2024-03-19 2024-03-28 2 2 129281094+james-aung@users.... oliver.jaffe@hotmail.co.uk
solver.py
in evals/solvers
125 17 2023-11-09 2024-03-28 4 2 junshern@users.noreply.gith... oliver.jaffe@hotmail.co.uk
self_consistency_solver.py
in evals/solvers/nested
118 6 2024-01-29 2024-03-28 3 2 junshern@users.noreply.gith... oliver.jaffe@hotmail.co.uk
make-me-pay.yaml
in evals/registry/solvers
101 - 2024-01-29 2024-03-28 2 2 junshern@users.noreply.gith... oliver.jaffe@hotmail.co.uk
self_prompting.yaml
in evals/registry/solvers
96 - 2024-01-29 2024-03-28 2 2 junshern@users.noreply.gith... oliver.jaffe@hotmail.co.uk
baseline.py
in evals/elsuite/hr_ml_agent_bench/solvers
90 3 2024-03-21 2024-03-28 2 2 danesherbs@users.noreply.gi... oliver.jaffe@hotmail.co.uk
strategy_solver.py
in evals/elsuite/bluff
88 5 2023-11-15 2024-03-28 3 3 33967107+johny-b@users.nore... oliver.jaffe@hotmail.co.uk
together.yaml
in evals/registry/solvers
85 - 2024-03-22 2024-03-28 2 2 giulio.starace@gmail.com oliver.jaffe@hotmail.co.uk
bluff.yaml
in evals/registry/solvers
80 - 2024-01-29 2024-03-28 2 2 junshern@users.noreply.gith... oliver.jaffe@hotmail.co.uk
track_the_stat.yaml
in evals/registry/solvers
75 - 2024-03-19 2024-03-28 2 2 giulio.starace@gmail.com oliver.jaffe@hotmail.co.uk
twenty_questions.yaml
in evals/registry/solvers
75 - 2024-03-19 2024-03-28 2 2 inwaves@users.noreply.githu... oliver.jaffe@hotmail.co.uk
already_said_that.yaml
in evals/registry/solvers
75 - 2024-03-19 2024-03-28 2 2 giulio.starace@gmail.com oliver.jaffe@hotmail.co.uk
together_solver.py
in evals/solvers/providers/together
68 10 2024-03-28 2024-03-28 1 1 oliver.jaffe@hotmail.co.uk oliver.jaffe@hotmail.co.uk
cot_solver.py
in evals/solvers/nested
61 7 2024-01-29 2024-03-28 3 2 junshern@users.noreply.gith... oliver.jaffe@hotmail.co.uk
memory.py
in evals/solvers
50 3 2024-03-28 2024-03-28 1 1 oliver.jaffe@hotmail.co.uk oliver.jaffe@hotmail.co.uk
eval.py
in evals/elsuite/make_me_say
48 3 2023-09-19 2024-03-28 3 2 140545726+ianmckenzie-oai@u... oliver.jaffe@hotmail.co.uk
solver_completion_fn.py
in evals/completion_fns
47 4 2024-03-28 2024-03-28 1 1 oliver.jaffe@hotmail.co.uk oliver.jaffe@hotmail.co.uk
utils.py
in evals/solvers
37 2 2023-11-09 2024-03-28 4 2 junshern@users.noreply.gith... oliver.jaffe@hotmail.co.uk
hr-ml-agent-bench.yaml
in evals/registry/solvers
37 - 2024-03-21 2024-03-28 2 2 danesherbs@users.noreply.gi... oliver.jaffe@hotmail.co.uk
error_recovery.yaml
in evals/registry/solvers
33 - 2024-03-19 2024-03-28 2 1 oliver.jaffe@hotmail.co.uk oliver.jaffe@hotmail.co.uk
incontext_rl.yaml
in evals/registry/solvers
24 - 2024-03-19 2024-03-28 2 2 129281094+james-aung@users.... oliver.jaffe@hotmail.co.uk
cant_do_that_anymore.yaml
in evals/registry/solvers
16 - 2024-03-19 2024-03-28 2 1 oliver.jaffe@hotmail.co.uk oliver.jaffe@hotmail.co.uk
task_description.py
in evals/elsuite/bugged_tools
9 - 2024-03-19 2024-03-28 2 1 oliver.jaffe@hotmail.co.uk oliver.jaffe@hotmail.co.uk
prompts.py
in evals/elsuite/function_deduction
6 - 2024-03-19 2024-03-28 2 2 129281094+james-aung@users.... oliver.jaffe@hotmail.co.uk
cot.py
in evals/solvers/prompts
4 - 2023-11-09 2024-03-28 2 2 junshern@users.noreply.gith... oliver.jaffe@hotmail.co.uk
gemini_solver.py
in evals/solvers/providers/google
157 9 2024-03-26 2024-03-26 1 1 oliver.jaffe@hotmail.co.uk oliver.jaffe@hotmail.co.uk
openai.py
in evals/completion_fns
147 10 2023-04-11 2024-03-26 5 6 jwang47@users.noreply.githu... oliver.jaffe@hotmail.co.uk
anthropic_solver.py
in evals/solvers/providers/anthropic
89 7 2024-03-21 2024-03-26 2 2 giulio.starace@gmail.com oliver.jaffe@hotmail.co.uk
api_utils.py
in evals/utils
15 1 2023-03-18 2024-03-26 10 8 120423412+andrew-openai@use... oliver.jaffe@hotmail.co.uk
gemini.yaml
in evals/registry/solvers
15 - 2024-03-26 2024-03-26 1 1 oliver.jaffe@hotmail.co.uk oliver.jaffe@hotmail.co.uk
actions.py
in evals/elsuite/multistep_web_tasks/webarena/browser_env
1014 50 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
processors.py
in evals/elsuite/multistep_web_tasks/webarena/browser_env
495 21 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
session.py
in evals/elsuite/multistep_web_tasks
416 28 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...
plot_experiments.py
in evals/elsuite/hr_ml_agent_bench/scripts
307 - 2024-03-21 2024-03-21 1 1 danesherbs@users.noreply.gi... danesherbs@users.noreply.gi...