Skip to content

Actions: braintrustdata/autoevals

Actions

Run pnpm evals

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
260 workflow runs
260 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Bump to gpt5 models
Run pnpm evals #352: Commit 2a023f7 pushed by Qard
56s gpt5
Bump to gpt5 models
Run pnpm evals #351: Commit 37bbe98 pushed by Qard
1m 0s gpt5
Bump to gpt5 models
Run pnpm evals #350: Commit 11d9f77 pushed by Qard
4m 48s gpt5
Bump to gpt5 models
Run pnpm evals #349: Commit 3f0c510 pushed by Qard
1m 2s gpt5
bump
Run pnpm evals #346: Commit 310c5cb pushed by ankrgyl
36s threads
bump
Run pnpm evals #345: Commit f5496d0 pushed by ankrgyl
33s threads
simplify heuristic
Run pnpm evals #344: Commit 5e2b55f pushed by ankrgyl
35s threads
consolidate
Run pnpm evals #343: Commit 782027e pushed by ankrgyl
33s threads
add some tests
Run pnpm evals #342: Commit 468c155 pushed by ankrgyl
41s threads