-
Notifications
You must be signed in to change notification settings - Fork 29
Pull requests: evaleval/every_eval_ever
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add AIR-Bench leaderboard to HELM adapter
#108
opened Apr 15, 2026 by
yifanmai
Collaborator
Loading…
Tests for Inspect adapter edge cases (empty choices, local paths, misleading dataset names)
#105
opened Apr 14, 2026 by
MattFisher
Loading…
Adapter for SWE-Bench Verified leaderboard evaluation results
#104
opened Apr 12, 2026 by
jatinganhotra
Loading…
test: expand coverage for duplicate entry checks
#103
opened Apr 6, 2026 by
gbemike
Contributor
Loading…
ci: add GitHub Actions workflow to run tests on PRs
#99
opened Apr 6, 2026 by
mrshu
Contributor
Loading…
Draft Proposal: Agent Session Result Layer
#70
opened Mar 17, 2026 by
elronbandel
Contributor
•
Draft
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.