9.3 release of Microsoft.Extensions.AI.Evaluation libraries #6100

peterwald · 2025-03-13T18:24:51Z

Note that we updated the Microsoft.Extensions.AI references to the published version. If another release happens later on this branch, we may want to update that.

cc @joperezr

Microsoft Reviewers: Open in CodeFlow

…6078) * Add types from AIJsonUtilities.JsonContext to reporting JsonContext * Add a few more tests * Chain M.E.AI.Eval serialization types through AIJsonUtilities * Fix reversed Compact/Default * Rename some of the types.

Also includes changes to the report to display this information. This addresses #6032. Additionally, this PR also includes numerous general improvements to the TypeScript report rendering - * Make the metric cards clickable and display metric details (such as diagnostics, reasons etc.) on click in a new collapsible section inline in the report (as opposed to in a hover tooltip). This addresses #6037. * Show conversations in a friendlier chat bubble form and make the sections that display conversation history collapsible so that long conversations can be collapsed. This addresses #6036 partially. * Introduce a global settings pane and move the per-textbox toggles for rendering markdown to the global location. This also addresses #6036 partially. * In the scenario tree, collapse single children into their respective parent level to reduce the amount of clicking required to expand deep trees. * Remove the scenario-level section for failure reasons since it is now possible to view failure reasons (and diagnostics) on a per-metric basis by clicking on metric cards. * A bunch of other minor layout, sizing and UX improvements and fixes for the report.

...xtensions.AI.Evaluation.Reporting/CSharp/Microsoft.Extensions.AI.Evaluation.Reporting.csproj

src/Libraries/Microsoft.Extensions.AI.Evaluation/Microsoft.Extensions.AI.Evaluation.csproj

....AI.Evaluation.Integration.Tests/Microsoft.Extensions.AI.Evaluation.Integration.Tests.csproj

peterwald · 2025-03-13T21:31:44Z

Thanks for the approval @joperezr. Just a heads up that I am not able to merge the PR. Feel free to complete it whenever you are ready.

peterwald and others added 3 commits March 13, 2025 11:40

Update M.E.AI dependencies to released packages.

5658d82

peterwald added the area-ai-eval Microsoft.Extensions.AI.Evaluation and related label Mar 13, 2025

peterwald self-assigned this Mar 13, 2025

peterwald requested review from a team as code owners March 13, 2025 18:24

shyamnamboodiripad reviewed Mar 13, 2025

View reviewed changes

...xtensions.AI.Evaluation.Reporting/CSharp/Microsoft.Extensions.AI.Evaluation.Reporting.csproj Show resolved Hide resolved

joperezr reviewed Mar 13, 2025

View reviewed changes

src/Libraries/Microsoft.Extensions.AI.Evaluation/Microsoft.Extensions.AI.Evaluation.csproj Show resolved Hide resolved

joperezr reviewed Mar 13, 2025

View reviewed changes

....AI.Evaluation.Integration.Tests/Microsoft.Extensions.AI.Evaluation.Integration.Tests.csproj Show resolved Hide resolved

Add comment RE removing version overrides for subsequent builds.

9abbe10

shyamnamboodiripad approved these changes Mar 13, 2025

View reviewed changes

joperezr approved these changes Mar 13, 2025

View reviewed changes

joperezr merged commit bead66c into release/9.3 Mar 14, 2025
6 checks passed

joperezr deleted the pewaldsc/aieval-9.3 branch March 14, 2025 01:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

9.3 release of Microsoft.Extensions.AI.Evaluation libraries #6100

9.3 release of Microsoft.Extensions.AI.Evaluation libraries #6100

peterwald commented Mar 13, 2025 •

edited by dotnet-policy-service bot

Loading

peterwald commented Mar 13, 2025

9.3 release of Microsoft.Extensions.AI.Evaluation libraries #6100

9.3 release of Microsoft.Extensions.AI.Evaluation libraries #6100

Conversation

peterwald commented Mar 13, 2025 • edited by dotnet-policy-service bot Loading

Microsoft Reviewers: Open in CodeFlow

peterwald commented Mar 13, 2025

peterwald commented Mar 13, 2025 •

edited by dotnet-policy-service bot

Loading