-
Notifications
You must be signed in to change notification settings - Fork 3.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
.Net: feature/llm openapi payload #9741
base: main
Are you sure you want to change the base?
Conversation
baywet
commented
Nov 18, 2024
- feat: adds basic infrastructure for proxy operation runner
- feat: adds chat client to proxy
- feat: adds chat client to the OpenApi extensions methods
Signed-off-by: Vincent Biret <[email protected]>
Signed-off-by: Vincent Biret <[email protected]>
Signed-off-by: Vincent Biret <[email protected]>
Signed-off-by: Vincent Biret <[email protected]>
Before starting to consider a dedicated runner that would use an AI model to create a payload dynamically, I would suggest trying/considering one of the "cheap" options available in SK today:
|
Thanks for the input. |
The other challenge I can see here with transformations is they run once during the setup of the function, whereas the payload generation needs to run once per function call. |
Thank you for the additional information. And according to you, why is a proxy implementation for the runner more "costly"? |
Because a) SK can do it today out of the box for the majority of scenarios (see option 1 in the first comment), and if further customization is needed, it's supported as well (see option 2 ), and b) the new AI-powered runner will require maintenance and will eventually need to be made public and configurable, either fully or partially, to accommodate various customization requests from customers. This, in turn, will increase SK's public API surface and decrease development speed because we don't want to introduce breaking changes to the public API surface. Therefore, the current preference is to leverage the existing options until there are strong signals in favor of the runner. UPD: I would also suggest creating an ADR at https://github.com/microsoft/semantic-kernel/tree/main/docs/decisions that outlines the new proposed design, highlights its pros and cons, and compares it to the suggested and available today options. |