Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

9.3 release of Microsoft.Extensions.AI.Evaluation libraries #6100

Merged
merged 4 commits into from
Mar 14, 2025

Conversation

peterwald
Copy link
Member

@peterwald peterwald commented Mar 13, 2025

Note that we updated the Microsoft.Extensions.AI references to the published version. If another release happens later on this branch, we may want to update that.

cc @joperezr

Microsoft Reviewers: Open in CodeFlow

peterwald and others added 3 commits March 13, 2025 11:40
…6078)

* Add types from AIJsonUtilities.JsonContext to reporting JsonContext

* Add a few more tests

* Chain M.E.AI.Eval serialization types through AIJsonUtilities

* Fix reversed Compact/Default

* Rename some of the types.
Also includes changes to the report to display this information. This addresses #6032.

Additionally, this PR also includes numerous general improvements to the TypeScript report rendering -
* Make the metric cards clickable and display metric details (such as diagnostics, reasons etc.) on click in a new collapsible section inline in the report (as opposed to in a hover tooltip). This addresses #6037.
* Show conversations in a friendlier chat bubble form and make the sections that display conversation history collapsible so that long conversations can be collapsed. This addresses #6036 partially.
* Introduce a global settings pane and move the per-textbox toggles for rendering markdown to the global location. This also addresses #6036 partially.
* In the scenario tree, collapse single children into their respective parent level to reduce the amount of clicking required to expand deep trees.
* Remove the scenario-level section for failure reasons since it is now possible to view failure reasons (and diagnostics) on a per-metric basis by clicking on metric cards.
* A bunch of other minor layout, sizing and UX improvements and fixes for the report.
@peterwald peterwald added the area-ai-eval Microsoft.Extensions.AI.Evaluation and related label Mar 13, 2025
@peterwald peterwald self-assigned this Mar 13, 2025
@peterwald peterwald requested review from a team as code owners March 13, 2025 18:24
@peterwald
Copy link
Member Author

Thanks for the approval @joperezr. Just a heads up that I am not able to merge the PR. Feel free to complete it whenever you are ready.

image

@joperezr joperezr merged commit bead66c into release/9.3 Mar 14, 2025
6 checks passed
@joperezr joperezr deleted the pewaldsc/aieval-9.3 branch March 14, 2025 01:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area-ai-eval Microsoft.Extensions.AI.Evaluation and related
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants