Skip to content

autoevals published at 0.0.130#172

Closed
cpinn wants to merge 1 commit intomainfrom
caitlin/autoevals-issue
Closed

autoevals published at 0.0.130#172
cpinn wants to merge 1 commit intomainfrom
caitlin/autoevals-issue

Conversation

@cpinn
Copy link
Contributor

@cpinn cpinn commented Feb 13, 2026

Autoevals version was accidentally reset in this pr: #154

Autoevals currently published to 0.0.130 version: https://pypi.org/project/autoevals/0.0.130/

@github-actions
Copy link

github-actions bot commented Feb 13, 2026

Braintrust eval report

Autoevals (caitlin/autoevals-issue-1771002032)

Score Average Improvements Regressions
NumericDiff 74.8% (+1pp) 3 🟢 -
Time_to_first_token 1.5tok (+0.08tok) 31 🟢 88 🔴
Llm_calls 1.55 (+0) - -
Tool_calls 0 (+0) - -
Errors 0 (+0) - -
Llm_errors 0 (+0) - -
Tool_errors 0 (+0) - -
Prompt_tokens 279.25tok (+0tok) - -
Prompt_cached_tokens 0tok (+0tok) - -
Prompt_cache_creation_tokens 0tok (+0tok) - -
Completion_tokens 18.6tok (+0tok) - -
Completion_reasoning_tokens 0tok (+0tok) - -
Total_tokens 297.85tok (+0tok) - -
Estimated_cost 0$ (+0$) - -
Duration 4.16s (+0.66s) 43 🟢 176 🔴
Llm_duration 3.06s (+0.16s) 25 🟢 94 🔴

@cpinn
Copy link
Contributor Author

cpinn commented Feb 13, 2026

apparently python was just not published

@cpinn cpinn closed this Feb 13, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant