Skip to content

Comments

Remove hallucinated Correctness and Complexity scorers#131

Merged
ibolmo merged 1 commit intomainfrom
fix-gh-129--hallucinated-correctness
Mar 25, 2025
Merged

Remove hallucinated Correctness and Complexity scorers#131
ibolmo merged 1 commit intomainfrom
fix-gh-129--hallucinated-correctness

Conversation

@ibolmo
Copy link
Collaborator

@ibolmo ibolmo commented Mar 25, 2025

Fixes: #129

Looks like while generating documentation/examples the LLM decided to play a trick on us and created new SpecFile based scorers but neglected to make the actual spec file.

Let's just remove them!

@ibolmo ibolmo self-assigned this Mar 25, 2025
pass


class Correctness(SpecFileClassifier):
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@ibolmo ibolmo merged commit 665bbfd into main Mar 25, 2025
8 checks passed
@ibolmo ibolmo deleted the fix-gh-129--hallucinated-correctness branch March 25, 2025 22:32
@github-actions
Copy link

github-actions bot commented Mar 25, 2025

Braintrust eval report

Autoevals (main-1742941950)

Score Average Improvements Regressions
NumericDiff 75% (+0pp) - -
Start 1742941949.73s - -
End 1742941951.26s - -
Duration 1.48s (+0.06s) 27 🟢 73 🔴
Prompt_tokens 279.25tok (+0tok) - -
Completion_tokens 18.17tok (+0tok) - -
Total_tokens 297.41tok (+0tok) - -

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

correctness scorer appears to be broken due to missing template

2 participants