Skip to content

Pull requests: petergpt/bullshit-benchmark

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add x-ai/grok-4.3 v2 benchmark results (xhigh)
#25 opened May 1, 2026 by patelnav Loading…
15 tasks done
Add primary-metric sort toggle to the main view
#22 opened Apr 22, 2026 by peterkirgis Loading…
Add MiniMax as direct LLM provider (M2.7 + M2.7-highspeed)
#16 opened Mar 30, 2026 by octo-patch Loading…
1 of 3 tasks
Add nonsensical question related to Waymo and PHP
#1 opened Feb 25, 2026 by drewhamlett Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.