huggingface / evaluation-guidebook
Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!
GitHub Repo 
0
lines of main code
0 files
0
lines of test code
0 files
2K
lines of other code
39 files
<1y
age
265 days
100%
main code touched
1 year (0 LOC)
100%
new main code
1 year (0 LOC)

1

69

1

14

2025 2024

generated by sokrates.dev (configuration) on 2025-06-30