Skip to content

Conversation

@ibolmo
Copy link
Contributor

@ibolmo ibolmo commented Sep 9, 2025

No description provided.

@ibolmo ibolmo self-assigned this Sep 9, 2025
@github-actions
Copy link

github-actions bot commented Sep 9, 2025

Braintrust eval report

No experiments to report

@github-actions
Copy link

github-actions bot commented Sep 9, 2025

Braintrust eval report

Say Hi Bot Python (posens/main-1757444564)

Score Average Improvements Regressions
Levenshtein 77.8% (+0pp) - -
Llm_calls 0 (+0) - -
Tool_calls 0 (+0) - -
Errors 0 (+0) - -
Llm_errors 0 (+0) - -
Tool_errors 0 (+0) - -
Prompt_tokens 0tok (+0tok) - -
Prompt_cached_tokens 0tok (+0tok) - -
Prompt_cache_creation_tokens 0tok (+0tok) - -
Completion_tokens 0tok (+0tok) - -
Total_tokens 0tok (+0tok) - -
Duration 0s (0s) 1 🟢 -

@github-actions
Copy link

github-actions bot commented Sep 9, 2025

Braintrust eval report

My Evaluation (posens/main-1757444566)

Score Average Improvements Regressions
Exact match 100% (+0pp) - -
Llm_calls 0 (+0) - -
Tool_calls 0 (+0) - -
Errors 0 (+0) - -
Llm_errors 0 (+0) - -
Tool_errors 0 (+0) - -
Prompt_tokens 10tok (+0tok) - -
Prompt_cached_tokens 0tok (+0tok) - -
Prompt_cache_creation_tokens 0tok (+0tok) - -
Completion_tokens 2tok (+0tok) - -
Total_tokens 12tok (+0tok) - -
Duration 0.04s (-0.04s) 1 🟢 -

@github-actions
Copy link

github-actions bot commented Sep 9, 2025

Braintrust eval report

Say Hi Bot Python (posens/main-1757444557)

Score Average Improvements Regressions
Levenshtein 77.8% (+0pp) - -
Llm_calls 0 (+0) - -
Tool_calls 0 (+0) - -
Errors 0 (+0) - -
Llm_errors 0 (+0) - -
Tool_errors 0 (+0) - -
Prompt_tokens 0tok (+0tok) - -
Prompt_cached_tokens 0tok (+0tok) - -
Prompt_cache_creation_tokens 0tok (+0tok) - -
Completion_tokens 0tok (+0tok) - -
Total_tokens 0tok (+0tok) - -
Duration 0s (+0s) - 1 🔴

@github-actions
Copy link

github-actions bot commented Sep 9, 2025

Braintrust eval report

No experiments to report

@ibolmo ibolmo merged commit c0dd75b into main Sep 9, 2025
7 checks passed
@ibolmo ibolmo deleted the posens/main branch September 9, 2025 19:12
@github-actions
Copy link

github-actions bot commented Sep 9, 2025

Braintrust eval report

Say Hi Bot Python (main-1757445149)

Score Average Improvements Regressions
Levenshtein 77.8% (+0pp) - -
Llm_calls 0 (+0) - -
Tool_calls 0 (+0) - -
Errors 0 (+0) - -
Llm_errors 0 (+0) - -
Tool_errors 0 (+0) - -
Prompt_tokens 0tok (+0tok) - -
Prompt_cached_tokens 0tok (+0tok) - -
Prompt_cache_creation_tokens 0tok (+0tok) - -
Completion_tokens 0tok (+0tok) - -
Total_tokens 0tok (+0tok) - -
Duration 0s (0s) 1 🟢 1 🔴

@github-actions
Copy link

github-actions bot commented Sep 9, 2025

Braintrust eval report

Say Hi Bot Python (main-1757445155)

Score Average Improvements Regressions
Levenshtein 77.8% (+0pp) - -
Llm_calls 0 (+0) - -
Tool_calls 0 (+0) - -
Errors 0 (+0) - -
Llm_errors 0 (+0) - -
Tool_errors 0 (+0) - -
Prompt_tokens 0tok (+0tok) - -
Prompt_cached_tokens 0tok (+0tok) - -
Prompt_cache_creation_tokens 0tok (+0tok) - -
Completion_tokens 0tok (+0tok) - -
Total_tokens 0tok (+0tok) - -
Duration 0s (0s) 1 🟢 1 🔴

@github-actions
Copy link

github-actions bot commented Sep 9, 2025

Braintrust eval report

My Evaluation (main-1757445155)

Score Average Improvements Regressions
Exact match 100% (+0pp) - -
Llm_calls 0 (+0) - -
Tool_calls 0 (+0) - -
Errors 0 (+0) - -
Llm_errors 0 (+0) - -
Tool_errors 0 (+0) - -
Prompt_tokens 10tok (+0tok) - -
Prompt_cached_tokens 0tok (+0tok) - -
Prompt_cache_creation_tokens 0tok (+0tok) - -
Completion_tokens 2tok (+0tok) - -
Total_tokens 12tok (+0tok) - -
Duration 0.14s (+0.09s) - 1 🔴

@github-actions
Copy link

github-actions bot commented Sep 9, 2025

Braintrust eval report

No experiments to report

@github-actions
Copy link

github-actions bot commented Sep 9, 2025

Braintrust eval report

No experiments to report

@github-actions
Copy link

github-actions bot commented Sep 9, 2025

Braintrust eval report

Say Hi Bot Python ([email protected])

Score Average Improvements Regressions
Levenshtein 77.8% (+0pp) - -
Llm_calls 0 (+0) - -
Tool_calls 0 (+0) - -
Errors 0 (+0) - -
Llm_errors 0 (+0) - -
Tool_errors 0 (+0) - -
Prompt_tokens 0tok (+0tok) - -
Prompt_cached_tokens 0tok (+0tok) - -
Prompt_cache_creation_tokens 0tok (+0tok) - -
Completion_tokens 0tok (+0tok) - -
Total_tokens 0tok (+0tok) - -
Duration 0s (+0s) 1 🟢 1 🔴

@github-actions
Copy link

github-actions bot commented Sep 9, 2025

Braintrust eval report

Say Hi Bot Python ([email protected])

Score Average Improvements Regressions
Levenshtein 77.8% (+0pp) - -
Llm_calls 0 (+0) - -
Tool_calls 0 (+0) - -
Errors 0 (+0) - -
Llm_errors 0 (+0) - -
Tool_errors 0 (+0) - -
Prompt_tokens 0tok (+0tok) - -
Prompt_cached_tokens 0tok (+0tok) - -
Prompt_cache_creation_tokens 0tok (+0tok) - -
Completion_tokens 0tok (+0tok) - -
Total_tokens 0tok (+0tok) - -
Duration 0s (0s) 1 🟢 1 🔴

@github-actions
Copy link

github-actions bot commented Sep 9, 2025

Braintrust eval report

No experiments to report

@github-actions
Copy link

github-actions bot commented Sep 9, 2025

Braintrust eval report

My Evaluation (HEAD-1757445394)

Score Average Improvements Regressions
Exact match 100% (+0pp) - -
Llm_calls 0 (+0) - -
Tool_calls 0 (+0) - -
Errors 0 (+0) - -
Llm_errors 0 (+0) - -
Tool_errors 0 (+0) - -
Prompt_tokens 10tok (+0tok) - -
Prompt_cached_tokens 0tok (+0tok) - -
Prompt_cache_creation_tokens 0tok (+0tok) - -
Completion_tokens 2tok (+0tok) - -
Total_tokens 12tok (+0tok) - -
Duration 0.01s (-0.13s) 1 🟢 -

@github-actions
Copy link

github-actions bot commented Sep 9, 2025

Braintrust eval report

No experiments to report

@github-actions
Copy link

github-actions bot commented Sep 9, 2025

Braintrust eval report

Say Hi Bot Python ([email protected])

Score Average Improvements Regressions
Levenshtein 77.8% (+0pp) - -
Llm_calls 0 (+0) - -
Tool_calls 0 (+0) - -
Errors 0 (+0) - -
Llm_errors 0 (+0) - -
Tool_errors 0 (+0) - -
Prompt_tokens 0tok (+0tok) - -
Prompt_cached_tokens 0tok (+0tok) - -
Prompt_cache_creation_tokens 0tok (+0tok) - -
Completion_tokens 0tok (+0tok) - -
Total_tokens 0tok (+0tok) - -
Duration 0s (0s) 2 🟢 -

@github-actions
Copy link

github-actions bot commented Sep 9, 2025

Braintrust eval report

No experiments to report

@github-actions
Copy link

github-actions bot commented Sep 9, 2025

Braintrust eval report

My Evaluation (HEAD-1757445544)

Score Average Improvements Regressions
Exact match 100% (+0pp) - -
Llm_calls 0 (+0) - -
Tool_calls 0 (+0) - -
Errors 0 (+0) - -
Llm_errors 0 (+0) - -
Tool_errors 0 (+0) - -
Prompt_tokens 10tok (+0tok) - -
Prompt_cached_tokens 0tok (+0tok) - -
Prompt_cache_creation_tokens 0tok (+0tok) - -
Completion_tokens 2tok (+0tok) - -
Total_tokens 12tok (+0tok) - -
Duration 0.04s (+0.03s) - 1 🔴

@github-actions
Copy link

github-actions bot commented Sep 9, 2025

Braintrust eval report

Say Hi Bot Python ([email protected])

Score Average Improvements Regressions
Levenshtein 77.8% (+0pp) - -
Llm_calls 0 (+0) - -
Tool_calls 0 (+0) - -
Errors 0 (+0) - -
Llm_errors 0 (+0) - -
Tool_errors 0 (+0) - -
Prompt_tokens 0tok (+0tok) - -
Prompt_cached_tokens 0tok (+0tok) - -
Prompt_cache_creation_tokens 0tok (+0tok) - -
Completion_tokens 0tok (+0tok) - -
Total_tokens 0tok (+0tok) - -
Duration 0s (0s) 1 🟢 1 🔴

@github-actions
Copy link

github-actions bot commented Sep 9, 2025

Braintrust eval report

No experiments to report

@github-actions
Copy link

github-actions bot commented Dec 29, 2025

Braintrust eval report

Say Hi Bot Python (nickslavin/ruby-1767044408)

Score Average Improvements Regressions
Levenshtein 77.8% (+0pp) - -
Llm_calls 0 (+0) - -
Tool_calls 0 (+0) - -
Errors 0 (+0) - -
Llm_errors 0 (+0) - -
Tool_errors 0 (+0) - -
Prompt_tokens 0tok (+0tok) - -
Prompt_cached_tokens 0tok (+0tok) - -
Prompt_cache_creation_tokens 0tok (+0tok) - -
Completion_tokens 0tok (+0tok) - -
Completion_reasoning_tokens 0tok (+0tok) - -
Total_tokens 0tok (+0tok) - -
Duration 0s (0s) 2 🟢 -

@github-actions
Copy link

github-actions bot commented Dec 29, 2025

Braintrust eval report

Say Hi Bot Python (nickslavin/ruby-1767044412)

Score Average Improvements Regressions
Levenshtein 77.8% (+0pp) - -
Llm_calls 0 (+0) - -
Tool_calls 0 (+0) - -
Errors 0 (+0) - -
Llm_errors 0 (+0) - -
Tool_errors 0 (+0) - -
Prompt_tokens 0tok (+0tok) - -
Prompt_cached_tokens 0tok (+0tok) - -
Prompt_cache_creation_tokens 0tok (+0tok) - -
Completion_tokens 0tok (+0tok) - -
Completion_reasoning_tokens 0tok (+0tok) - -
Total_tokens 0tok (+0tok) - -
Duration 0s (+0s) - 1 🔴

@github-actions
Copy link

github-actions bot commented Dec 29, 2025

Braintrust eval report

My Evaluation (nickslavin/ruby-1767044415)

Score Average Improvements Regressions
Exact match 100% (+0pp) - -
Llm_calls 0 (+0) - -
Tool_calls 0 (+0) - -
Errors 0 (+0) - -
Llm_errors 0 (+0) - -
Tool_errors 0 (+0) - -
Prompt_tokens 10tok (+0tok) - -
Prompt_cached_tokens 0tok (+0tok) - -
Prompt_cache_creation_tokens 0tok (+0tok) - -
Completion_tokens 2tok (+0tok) - -
Completion_reasoning_tokens 0tok (+0tok) - -
Total_tokens 12tok (+0tok) - -
Duration 0.2s (+0.16s) - 1 🔴

@github-actions
Copy link

github-actions bot commented Dec 29, 2025

Braintrust eval report

No experiments to report

@github-actions
Copy link

github-actions bot commented Dec 29, 2025

Braintrust eval report

No experiments to report

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants