-
Notifications
You must be signed in to change notification settings - Fork 100
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add AI Evaluation tests for eShopSupport. #49
Conversation
Testing updates and cleanup. Latest updates Update library Update evaluations. Add JSON cleanup Add different kinds of evaluation tests, defer to a eval handler Fix writing of scenario results Update to latest Evaluation library API Update Evaluation API Simplify the API Update packages to public feed Add required data dep to Tokenizers Add check for diagnostic errors Revert changes to run against AzureOpenAI More fixes Fix using
@peterwald Should we add a |
@luisquintanilla @peterwald should we also remove the I am guessing we don't want to do this yet since not all tests from the |
Since we're recommending using the evaluations in the |
There is more in the Evaluator project than what is done by the tests. I'll leave it here for now to not eliminate those options. |
Sounds good - should we log an issue to port over the remaining functionality and remove this eventually? @luisquintanilla |
Yeah. That would be great. Tracking here #50. Feel free to update / add details as needed. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Thanks @peterwald @shyamnamboodiripad
One last question before merging. Is this using the latest version of the libraries? If not, can you bump the version and then I'll merge.
Thanks
Add EvaluationTests project that demonstrates how to use Microsoft.Extensions.AI.Evaluation to run evaluations on LLM prompts and responses.