This AI sample lacks risk & safety evaluation implementation #236

carlotta94c · 2025-01-10T16:39:59Z

As per email sent on 13th Dec with same subject, please double-check that this AI sample implements evaluations. In particular, what we are looking for is:

Evaluation file(s) (might be a Jupyter notebook, a unit test script, etc.) that evaluates the solution against quality metrics
Evaluation file(s) (might be a Jupyter notebook, a unit test script, etc.) that evaluates the solution against at least 2 safety metrics
A descriptive section in your readme explaining how evaluation is implemented into the sample.

carlotta94c · 2025-01-10T16:46:15Z

@nitya Adding this issue here so we can track progress on the evaluation stream. I know you mentioned that one blocker is that existing evaluations should be migrated to the Azure AI eval SDK before adding more metrics.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

This AI sample lacks risk & safety evaluation implementation #236

This AI sample lacks risk & safety evaluation implementation #236

carlotta94c commented Jan 10, 2025

carlotta94c commented Jan 10, 2025

This AI sample lacks risk & safety evaluation implementation #236

This AI sample lacks risk & safety evaluation implementation #236

Comments

carlotta94c commented Jan 10, 2025

carlotta94c commented Jan 10, 2025