fix: new URL for AWS Bedrock and model list support by are-ces · Pull Request #4946 · llamastack/llama-stack

are-ces · 2026-02-18T11:15:52Z

What does this PR do?

The official endpoint for AWS Bedrock has changed according to AWS Documentation

This new endpoint supports the /v1/models endpoint, thus I added the method for listing models too.

Test Plan

Call Responses API with AWS Bedrock; list models

mattf · 2026-02-18T11:51:32Z

mattf

please fix the tests

are-ces · 2026-02-18T13:18:12Z

I am using Responses API in llama-stack, does this mean we need to use both endpoints?

PS: tool calling does not work on the current endpoint

are-ces · 2026-02-18T13:24:54Z

Perhaps @skamenan7 can join the conversation

mattf · 2026-02-18T15:51:25Z

I am using Responses API in llama-stack, does this mean we need to use both endpoints?

stack implements its /v1/responses using a backend provider's /v1/chat//completions. you should not have to change anything.

PS: tool calling does not work on the current endpoint

this is a good issue to file. does switching the endpoint get tool calling to work?

are-ces · 2026-02-19T07:45:53Z

Tool working does not work, you can check this notebook.
This article also mentions tool calling not supported.

With the mantle endpoint it works.

mattf · 2026-02-19T09:49:09Z

@skamenan7 what are the errors when using tools w/ the existing bedrock provider?

skamenan7 · 2026-02-25T13:29:54Z

@skamenan7 what are the errors when using tools w/ the existing bedrock provider?

Hi @mattf , with the current bedrock-runtime endpoint, sending tools in the request causes the server to return 200 OK but then hang indefinitely — no response body, no error, just an open connection until the client times out. There's no actionable error message, it silently accepts the request and never responds. I hit this consistently in local testing (both direct curl and through the stack).

ps: I away few days, hence late reply.

skamenan7

Thanks @are-ces , looks great except for few comments.

skamenan7 · 2026-02-25T13:57:27Z

src/llama_stack/providers/remote/inference/bedrock/bedrock.py

+            return await super().list_provider_model_ids()
+        except Exception as e:
+            logger.debug(f"Failed to list Bedrock models dynamically: {e}")
+            return []


I might be wrong on this, but I think returning [] on failure could cause model registry churn? ModelsRoutingTable.refresh() treats the result as authoritative, so on a transient 401/5xx it'd call update_registered_models(..., models=[]) and unregister all bedrock models.
Users'd see intermittent "model not found" until the next successful refresh.

If that's right, letting the exception propagate might be the simplest fix as refresh() already catches exceptions from list_models(). Or
narrowing the catch to only NotFoundError / APIConnectionError would also work, right?

skamenan7 · 2026-02-25T14:00:50Z

src/llama_stack/providers/remote/inference/bedrock/bedrock.py

-        return []
+        try:
+            return await super().list_provider_model_ids()
+        except Exception as e:


One thing I noticed that except Exception catches AuthenticationError too, so if someone's credentials are wrong they'd get an empty model list with a debug-level log instead of finding out right away. They wouldn't see the actual auth error until they try a chat completion.

Also the parent list_models() already has a try/except that logs at error level and re-raises, but this catch fires first and prevents that
from kicking in.

Narrowing to APIConnectionError / NotFoundError might be cleaner? That way auth failures still surface. Or bumping to warning at minimum so it's not invisible.

skamenan7 · 2026-02-25T14:06:41Z

src/llama_stack/providers/remote/inference/bedrock/bedrock.py

+            logger.debug(f"Failed to list Bedrock models dynamically: {e}")
+            return []

    async def check_model_availability(self, model: str) -> bool:


isn't the parent check_model_availability already handle the empty-cache case. The only
extra thing this override seems to do is force cache population when model_store already has the model, isn't it? is there a downstream operation that depends on that?

If not, removing the override might be simpler.

skamenan7 · 2026-02-25T14:24:28Z

nit: the new list_provider_model_ids fallback and check_model_availability override don't have unit test coverage. A couple of quick tests would help here.

mattf · 2026-02-25T16:25:40Z

@skamenan7 what are the errors when using tools w/ the existing bedrock provider?

Hi @mattf , with the current bedrock-runtime endpoint, sending tools in the request causes the server to return 200 OK but then hang indefinitely — no response body, no error, just an open connection until the client times out. There's no actionable error message, it silently accepts the request and never responds. I hit this consistently in local testing (both direct curl and through the stack).

ps: I away few days, hence late reply.

that's terrible.

i hope you have a route to file a bug against bedrock.

does moving to the new endpoint resolve this?

skamenan7 · 2026-02-26T18:35:19Z

Hi @mattf, I just tested both endpoints directly with SigV4 auth and tools param, and tool calling works on both now — bedrock-runtime returned finish_reason: "tool_calls" with the correct function call, no hang. Looks like AWS fixed it since my Oct testing. I did not get a chance to test it again since then.

But @are-ces change also has /v1/models support (old endpoint returns empty, mantle lists 30+ models) and a much bigger model catalog (DeepSeek, Mistral, Qwen, Gemma alongside GPT-OSS). The list_provider_model_ids() change makes sense with mantle since /v1/models actually returns data there.

mattf · 2026-02-27T13:18:28Z

Hi @mattf, I just tested both endpoints directly with SigV4 auth and tools param, and tool calling works on both now — bedrock-runtime returned finish_reason: "tool_calls" with the correct function call, no hang. Looks like AWS fixed it since my Oct testing. I did not get a chance to test it again since then.

that's good news.

But @are-ces change also has /v1/models support (old endpoint returns empty, mantle lists 30+ models) and a much bigger model catalog (DeepSeek, Mistral, Qwen, Gemma alongside GPT-OSS). The list_provider_model_ids() change makes sense with mantle since /v1/models actually returns data there.

i don't have an environment to test this. will you do the review, verify it works, doesn't introduce any regressions, has a good user experience, and i'll mash merge?

skamenan7 · 2026-02-27T14:38:57Z

Sure, @mattf, I will run through by these criteria and let you know.

skamenan7 · 2026-02-27T18:26:38Z

Ran through this carefully @mattf. The core change is right — mantle endpoint confirmed working for tool calling,
model listing, and streaming.

Two things worth addressing before merge:

list_provider_model_ids catches AuthenticationError in the broad except Exception block and returns
[] with a debug log. Would be better to re-raise it so users with bad credentials get a clear error
at model listing time, not a silent empty list.
The check_model_availability override looks redundant — the parent already handles the empty-cache
case. If there's a reason it needs to be there, a comment explaining the "why" would help.

If @are-ces addresses these I will check again and provide feedback

are-ces · 2026-03-02T16:08:19Z

Hi! I will address all issues asap 👍

Fix integration tests

are-ces · 2026-03-03T12:47:48Z

@skamenan7 I have addressed your point removing the overrides, they were indeed not needed.

are-ces requested review from ashwinb, bbrowning, cdoern, ehhuang, franciscojavierarceo, leseb, mattf and raghotham as code owners February 18, 2026 11:15

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Feb 18, 2026

mattf requested changes Feb 18, 2026

View reviewed changes

are-ces force-pushed the aws-fix branch from 30d1449 to 2e6918d Compare February 18, 2026 13:12

are-ces changed the title ~~fix: wrong URL for AWS Bedrock~~ fix: new URL for AWS Bedrock and model list support Feb 18, 2026

are-ces force-pushed the aws-fix branch from c1734d2 to e2b5dd4 Compare February 18, 2026 13:51

skamenan7 suggested changes Feb 25, 2026

View reviewed changes

are-ces force-pushed the aws-fix branch from e2b5dd4 to 41b3000 Compare March 3, 2026 12:45

Fix wrong URL for AWS Bedrock and add model list

69bd927

Fix integration tests

are-ces force-pushed the aws-fix branch from 41b3000 to 69bd927 Compare March 3, 2026 12:47

Conversation

are-ces commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Test Plan

Uh oh!

mattf commented Feb 18, 2026

Uh oh!

mattf left a comment

Choose a reason for hiding this comment

Uh oh!

are-ces commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

are-ces commented Feb 18, 2026

Uh oh!

mattf commented Feb 18, 2026

Uh oh!

are-ces commented Feb 19, 2026

Uh oh!

mattf commented Feb 19, 2026

Uh oh!

skamenan7 commented Feb 25, 2026

Uh oh!

skamenan7 left a comment

Choose a reason for hiding this comment

Uh oh!

skamenan7 Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

skamenan7 Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

skamenan7 Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

skamenan7 commented Feb 25, 2026

Uh oh!

mattf commented Feb 25, 2026

Uh oh!

skamenan7 commented Feb 26, 2026

Uh oh!

mattf commented Feb 27, 2026

Uh oh!

skamenan7 commented Feb 27, 2026

Uh oh!

skamenan7 commented Feb 27, 2026

Uh oh!

are-ces commented Mar 2, 2026

Uh oh!

are-ces commented Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

are-ces commented Feb 18, 2026 •

edited

Loading

are-ces commented Feb 18, 2026 •

edited

Loading