-
Notifications
You must be signed in to change notification settings - Fork 752
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
400 - {'detail': 'Invalid value: Non supported ToolPromptFormat ToolPromptFormat.json'} - with default tool_prompt_format 3.3/3.2 #695
Comments
I think thats because the tool_prompt_format by default is json in the llama stack api, but why isn't json working when thats the default prompt format ? Update: For all the 3.1 models, using tool_prompt_format as json is correct but for 3.3, the tool_prompt format is supposed to be python_list only. Attached below is the ss of the code But why aren't other formats like json not supported for 3.3 ? |
To solve this we can just add a check in inference APIs, saying |
@cheesecake100201 yeah that's what I'm thinking, if the user didn't specify the tool_prompt_format we should set the correct default for them. |
Thanks for filing this @aidando73 we will need to infer right format instead of always defaulting to json. I will work on fixing this. |
…742) # What does this PR do? We are setting a default value of json for tool prompt format, which conflicts with llama 3.2/3.3 models since they use python list. This PR changes the defaults to None and in the code, we infer default based on the model. Addresses: #695 Tests: ❯ LLAMA_STACK_BASE_URL=http://localhost:5000 pytest -v tests/client-sdk/inference/test_inference.py -k "test_text_chat_completion" pytest llama_stack/providers/tests/inference/test_prompt_adapter.py
System Info
Information
🐛 Describe the bug
Using fireworks provider
This returns a 400 error
Error logs
Returns
Expected behavior
The corrent tool_prompt_format is python_list - Note that I can pass it in explicitly:
But that requires understanding of the internals of llama-stack and tool prompt formats which most users won't have.
The text was updated successfully, but these errors were encountered: