Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Introduce Content Safety evaluators #6223

Open
wants to merge 4 commits into
base: main
Choose a base branch
from

Conversation

shyamnamboodiripad
Copy link
Contributor

@shyamnamboodiripad shyamnamboodiripad commented Apr 1, 2025

The new evaluators will ship in a new Microsoft.Extensions.AI.Evaluation.Safety package.

Also includes the following public API changes:

  • Add a Metadata dictionary on EvaluationMetric.
  • Make EvaluationMetric.Diagnostics nullable.
  • Convert instance functions on some (fully mutable) result types to extension methods in the same namespace.

Some more work remains before this can be merged:

  • Add a couple more evaluators that are currently missing
  • Add more documentation comments for the new evaluators and metrics
  • Display metadata for individual metrics in the report
  • Some miscellaneous clean up
  • Add few more tests

Fixes #5937

Microsoft Reviewers: Open in CodeFlow

@shyamnamboodiripad shyamnamboodiripad requested review from a team as code owners April 1, 2025 11:03
@github-actions github-actions bot added the area-ai-eval Microsoft.Extensions.AI.Evaluation and related label Apr 1, 2025
Copy link
Member

@peterwald peterwald left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

:shipit:

@shyamnamboodiripad shyamnamboodiripad force-pushed the rai branch 2 times, most recently from 058b183 to 1fde648 Compare April 1, 2025 20:54
@shyamnamboodiripad shyamnamboodiripad force-pushed the rai branch 2 times, most recently from 80d4761 to ca73327 Compare April 2, 2025 10:01
Includes the following API changes:
- Add a Metadata dictionary on EvaluationMetric
- Make EvaluationMetric.Diagnostics nullable
- Convert instance functions on some (fully mutable) result types to extension methods in the same namespace
@shyamnamboodiripad shyamnamboodiripad force-pushed the rai branch 2 times, most recently from 5eb798a to 40fb1c6 Compare April 3, 2025 14:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area-ai-eval Microsoft.Extensions.AI.Evaluation and related
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[AI Evaluation] Introduce RAI evaluators
3 participants