Add benchmark results for x-ai/grok-4.1-fast #172

github-actions · 2025-11-20T00:32:00Z

This PR adds benchmark results for the x-ai/grok-4.1-fast model.

The following files have been updated:

src/benchmark/results.json - Raw benchmark results
src/benchmark/validation-results.json - Validation results against human baseline

This PR was automatically generated by the benchmark workflow.

Note: If you don't want to merge this PR, close it and the model will be added to the untested list to prevent re-processing.

@alrocar

Note

Adds x‑ai/grok-4.1-fast to the benchmark config and records its raw benchmark and validation outputs, which show no matches against the human baseline.

Benchmarks:
- Add raw results for x-ai/grok-4.1-fast in src/benchmark/results.json (queries, attempts, metrics, errors).
Validation:
- Add validation entries for x-ai/grok-4.1-fast in src/benchmark/validation-results.json; all cases show no matches (0 exact/numeric) with corresponding SQL recorded; aggregate model stats appended.
Config:
- Append grok-4.1-fast to x-ai models in src/benchmark-config.json.

^{Written by Cursor Bugbot for commit 70a21d0. This will update automatically on new commits. Configure here.}

vercel · 2025-11-20T00:32:05Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Preview	Comments	Updated (UTC)
llm-benchmark	Ready	Preview	Comment	Nov 20, 2025 0:33am

feat: add benchmark results for x-ai/grok-4.1-fast

70a21d0

vercel bot deployed to Preview November 20, 2025 00:33 View deployment

kmk142789 approved these changes Nov 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add benchmark results for x-ai/grok-4.1-fast #172

Add benchmark results for x-ai/grok-4.1-fast #172

Uh oh!

github-actions bot commented Nov 20, 2025 •

edited by cursor bot

Loading

Uh oh!

vercel bot commented Nov 20, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add benchmark results for x-ai/grok-4.1-fast #172

Are you sure you want to change the base?

Add benchmark results for x-ai/grok-4.1-fast #172

Uh oh!

Conversation

github-actions bot commented Nov 20, 2025 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vercel bot commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

github-actions bot commented Nov 20, 2025 •

edited by cursor bot

Loading

vercel bot commented Nov 20, 2025 •

edited

Loading