TruLens Eval v0.17.0
Changelog:
- Add criteria and improve chain of thought prompting for evals
- Allow feedback functions to be in different directions with appropriate coloring/emojis
- Filter leaderboard feedback function results to only those available for the given app id
- Add smoke testing/benchmarking for groundedness based on SummEval dataset
Bug Fixes:
- Fix issue with LiteLLM provider
- Allow Groundedness to run with any LLM provider
Examples
- Using Anthropic Claude to run feedback functions