modelcontextprotocol
diff --git a/‎docs/advanced/low-level-server.md‎
Lines changed: 1 addition & 1 deletion b/‎docs/advanced/low-level-server.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/advanced/multi-round-trip.md‎
Lines changed: 5 additions & 5 deletions b/‎docs/advanced/multi-round-trip.md‎
Lines changed: 5 additions & 5 deletions
diff --git a/‎docs/migration.md‎
Lines changed: 1 addition & 1 deletion b/‎docs/migration.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs_src/mrtr/tutorial005.py‎
Lines changed: 6 additions & 1 deletion b/‎docs_src/mrtr/tutorial005.py‎
Lines changed: 6 additions & 1 deletion
diff --git a/‎examples/stories/mrtr/README.md‎
Lines changed: 2 additions & 1 deletion b/‎examples/stories/mrtr/README.md‎
Lines changed: 2 additions & 1 deletion
diff --git a/‎examples/stories/mrtr/server_lowlevel.py‎
Lines changed: 3 additions & 2 deletions b/‎examples/stories/mrtr/server_lowlevel.py‎
Lines changed: 3 additions & 2 deletions
diff --git a/‎src/mcp/server/auth/middleware/bearer_auth.py‎
Lines changed: 3 additions & 8 deletions b/‎src/mcp/server/auth/middleware/bearer_auth.py‎
Lines changed: 3 additions & 8 deletions
diff --git a/‎src/mcp/server/auth/provider.py‎
Lines changed: 11 additions & 0 deletions b/‎src/mcp/server/auth/provider.py‎
Lines changed: 11 additions & 0 deletions
@@ -181,7 +181,7 @@ The handshake belongs to the runner. `server/discover`, `ping`, and every other
 
 Each of these is one idea you now have the vocabulary for; each has its own chapter.
 
-* `on_call_tool`, `on_get_prompt`, and `on_read_resource` may return an `InputRequiredResult` instead of their normal result to pause the call and ask the client for input; see **[Multi-round-trip requests](multi-round-trip.md)**. True to this tier, nothing is required at construction: the `request_state` you set crosses the wire exactly as written until you opt in with `server.middleware.append(RequestStateBoundary(RequestStateSecurity(keys=[...])))`: one line (both names import from `mcp.server.request_state`) for the identical sealing and verification `MCPServer` enforces (**[Protecting `requestState`](multi-round-trip.md#protecting-requeststate)**).
+* `on_call_tool`, `on_get_prompt`, and `on_read_resource` may return an `InputRequiredResult` instead of their normal result to pause the call and ask the client for input; see **[Multi-round-trip requests](multi-round-trip.md)**. True to this tier, nothing is required at construction: the `request_state` you set crosses the wire exactly as written until you opt in with `server.middleware.append(RequestStateBoundary(RequestStateSecurity(keys=[...]), default_audience=server.name))`: one line (both names import from `mcp.server.request_state`) for the identical sealing and verification `MCPServer` enforces (**[Protecting `requestState`](multi-round-trip.md#protecting-requeststate)**).
 * `on_list_resources`, `on_read_resource`, `on_list_prompts`, `on_get_prompt`, `on_completion` are the same `(ctx, params) -> result` shape for the other primitives.
 * `server.streamable_http_app()` returns the same Starlette app `MCPServer`'s does; deploy it the way **[Running your server](../run/index.md)** deploys any other ASGI app. There is no `server.run(transport=...)` down here: `server.run(read_stream, write_stream, server.create_initialization_options())` drives one connection over a pair of streams, and that one line is the whole story.
 
 
@@ -112,9 +112,9 @@ mcp = MCPServer("dev", request_state_security=RequestStateSecurity.ephemeral())
 With either built-in configuration, `requestState` on the wire is an encrypted, authenticated token. Your code never sees it: handlers and resolvers write plaintext and read plaintext (`ctx.request_state`); the SDK seals on the way out and verifies on the way in. Beyond integrity, each token is bound to:
 
 * **A time window.** Every round re-seals with a fresh expiry, so `RequestStateSecurity(ttl=...)` (default 600 seconds) bounds per-round think time, not the whole flow.
-* **The authenticated client.** When the request carries an OAuth access token the SDK validated, the state is bound to that `client_id`: a token minted for one principal fails under another. When auth is terminated outside the SDK (a fronting proxy), or the transport is unauthenticated, there is no principal to bind and this check is inert, unless `RequestStateSecurity(bind_principal=...)` supplies one from your own identity signal.
+* **The authenticated principal.** When the request carries an OAuth access token the SDK validated, the state is bound to the token's client, issuer, and subject: state minted for one user fails under another, even when both users share one OAuth client. A verifier that supplies no subject degrades the binding to the client identity alone, which under URL-based client IDs is shared by every user of that client software. When auth is terminated outside the SDK (a fronting proxy), or the transport is unauthenticated, there is no principal to bind and this check is inert, unless `RequestStateSecurity(bind_principal=...)` supplies one from your own identity signal. Whichever components your token verifier supplies, it must supply them consistently: a verifier that includes the subject on some requests and omits it on others changes the principal mid-flow, and in-flight rounds are rejected.
 * **The originating request.** The method, the tool or prompt name (or resource URI), and a digest of the arguments. A token replayed against a different tool, different arguments, or a different method fails.
-* **The exact question asked.** A recorded resolver answer is pinned to the rendered question the client was shown. Redeploy with a reworded message or a changed schema and the server re-asks instead of reusing a stale answer. The same pinning cuts the other way: derive messages from the tool's arguments, not from per-call data. A message built from a timestamp or a live rate renders differently every round, so every recorded answer looks stale and the server re-asks until the client's round limit ends the call.
+* **The exact question asked.** Every resolver answer is pinned to the rendered question the client was shown, both on the round it first arrives and when a recorded answer is reused later. Redeploy with a reworded message or a changed schema and the server re-asks instead of consuming a stale answer. The same pinning cuts the other way: derive messages from the tool's arguments, not from per-call data. A message built from a timestamp or a live rate renders differently every round, so every recorded answer looks stale and the server re-asks until the client's round limit ends the call.
 
 All of that is the SDK's job, not yours, and not the codec's if you bring your own.
 
@@ -130,13 +130,13 @@ RequestStateSecurity(keys=[NEW])       # 3: one ttl after phase 2 is fully out,
 
 Never promote the minter first: minting under a key some instance can't yet verify drops in-flight rounds mid-rollout.
 
-Keys are scoped to one service. The sealed envelope also carries the server's name as an audience claim by default, so a token minted by a different service that happens to share a secret is rejected anyway. `RequestStateSecurity(audience=...)` overrides the claim for deliberate multi-service topologies where one service must accept state another minted.
+Keys are scoped to one service. The sealed envelope also carries the server's name as an audience claim, so a token minted by a different service that happens to share a secret is rejected anyway. The claim is only as distinctive as the name, which is why `MCPServer` refuses `request_state_security=` on an unnamed server. `RequestStateSecurity(audience=...)` overrides the claim for deliberate multi-service topologies where one service must accept state another minted.
 
 ### Bring your own crypto
 
 `RequestStateSecurity(codec=...)` takes anything with `seal(bytes) -> str` and `unseal(str) -> bytes` that raises `InvalidRequestState` for any token it did not mint. The classic shape is envelope encryption against a KMS, where you unwrap a data key once at startup and keep the per-token crypto local:
 
-```python title="server.py" hl_lines="12 29-30 33"
+```python title="server.py" hl_lines="12 26-27 34-35 38"
 --8<-- "docs_src/mrtr/tutorial005.py"
 ```
 
@@ -184,6 +184,6 @@ The low-level `Server` is the no-batteries tier: nothing is required at construc
 * To inspect or persist rounds, use `client.session.call_tool(..., allow_input_required=True)` and own the `while isinstance(result, InputRequiredResult)` loop yourself.
 * On `@mcp.tool()`, a dependency that asks the user produces this result for you (**[Dependencies](../tutorial/dependencies.md)**); the **low-level** `Server` is the manual form.
 * Prompts and resources participate too: an `@mcp.prompt()` or template `@mcp.resource()` function returns the `InputRequiredResult` itself and reads `ctx.input_responses` on the retry.
-* `requestState` comes back as client-supplied input. `MCPServer` requires a `request_state_security=` choice before it will register a `Resolve(...)` tool, and seals hand-built state with the same machinery once you configure it. The seal binds every token to a time window, the originating request, and the authenticated client when the request carries auth the SDK validated or `bind_principal=` supplies your own identity signal (**[Protecting `requestState`](#protecting-requeststate)**).
+* `requestState` comes back as client-supplied input. `MCPServer` requires a `request_state_security=` choice before it will register a `Resolve(...)` tool, and seals hand-built state with the same machinery once you configure it. The seal binds every token to a time window, the originating request, and the authenticated principal when the request carries auth the SDK validated or `bind_principal=` supplies your own identity signal (**[Protecting `requestState`](#protecting-requeststate)**).
 
 This is the mechanism that replaces server-initiated sampling and the rest of the push-style back-channel; see **[Deprecated features](deprecated.md)**.
@@ -433,7 +433,7 @@ from mcp.server.mcpserver import MCPServer, RequestStateSecurity
 mcp = MCPServer("my-server", request_state_security=RequestStateSecurity.ephemeral())
 ```
 
-Multi-instance deployments share secret keys instead (`RequestStateSecurity(keys=[...])`) so every instance can verify what a sibling minted. The choices, what gets sealed, key rotation, and custom codecs are covered in [Protecting `requestState`](advanced/multi-round-trip.md#protecting-requeststate).
+Multi-instance deployments share secret keys instead (`RequestStateSecurity(keys=[...])`) so every instance can verify what a sibling minted. A configured server must also be named (or pass `RequestStateSecurity(audience=...)`): the name becomes the sealed token's audience claim, so an unnamed server raises `ValueError` at construction. The choices, what gets sealed, key rotation, and custom codecs are covered in [Protecting `requestState`](advanced/multi-round-trip.md#protecting-requeststate).
 
 On a protected server the wire `requestState` is an opaque sealed token, and `ctx.request_state` returns the verified plaintext your handler originally wrote. Sealing and verification happen at the wire boundary, so handler code reads exactly what it minted. Hand-built `requestState` (a tool, prompt, or resource-template function returning `InputRequiredResult` itself) is unaffected unless you opt in, in which case it is sealed and verified automatically too.
 
 
@@ -23,8 +23,13 @@ def seal(self, payload: bytes) -> str:
         return PREFIX + (nonce + self._aesgcm.encrypt(nonce, payload, PREFIX.encode())).hex()
 
     def unseal(self, token: str) -> bytes:
+        if not token.startswith(PREFIX):
+            raise InvalidRequestState("unknown token format")
+        body = token[len(PREFIX) :]
         try:
-            raw = bytes.fromhex(token.removeprefix(PREFIX))
+            raw = bytes.fromhex(body)
+            if raw.hex() != body:  # only the exact string seal() produced verifies
+                raise ValueError("non-canonical hex")
             return self._aesgcm.decrypt(raw[:12], raw[12:], PREFIX.encode())
         except (ValueError, InvalidTag) as exc:
             raise InvalidRequestState("token failed verification") from exc
 
@@ -56,7 +56,8 @@ uv run python -m stories.mrtr.client --http --server server_lowlevel
   then completes the round normally.
 - `server_lowlevel.py`: the lowlevel tier has no construction-time
   requirement; the same enforcement is one appended middleware:
-  `server.middleware.append(RequestStateBoundary(RequestStateSecurity.ephemeral()))`.
+  `server.middleware.append(RequestStateBoundary(RequestStateSecurity.ephemeral(),
+  default_audience=server.name))`.
 
 ## Caveats
 
 
@@ -57,8 +57,9 @@ async def call_tool(
         return types.CallToolResult(content=[types.TextContent(text=f"deployment to {env} cancelled")])
 
     server = Server("mrtr-example", on_list_tools=list_tools, on_call_tool=call_tool)
-    # Lowlevel opt-in: append the same boundary middleware MCPServer installs from request_state_security=.
-    server.middleware.append(RequestStateBoundary(RequestStateSecurity.ephemeral()))
+    # Lowlevel opt-in: append the same boundary middleware MCPServer installs from
+    # request_state_security=; the server name becomes the token audience.
+    server.middleware.append(RequestStateBoundary(RequestStateSecurity.ephemeral(), default_audience=server.name))
     return server
 
 
 
@@ -7,7 +7,7 @@
 from starlette.requests import HTTPConnection
 from starlette.types import Receive, Scope, Send
 
-from mcp.server.auth.provider import AccessToken, TokenVerifier
+from mcp.server.auth.provider import AccessToken, TokenVerifier, principal_components
 
 
 class AuthenticatedUser(SimpleUser):
@@ -34,13 +34,8 @@ def authorization_context(user: AuthenticatedUser) -> AuthorizationContext:
     See `examples/servers/simple-auth/mcp_simple_auth/token_verifier.py` for
     a verifier that populates `subject` and `claims` from an introspection
     response."""
-    token = user.access_token
-    issuer = (token.claims or {}).get("iss")
-    return AuthorizationContext(
-        client_id=token.client_id,
-        issuer=str(issuer) if issuer is not None else None,
-        subject=token.subject,
-    )
+    client_id, issuer, subject = principal_components(user.access_token)
+    return AuthorizationContext(client_id=client_id, issuer=issuer, subject=subject)
 
 
 class BearerAuthBackend(AuthenticationBackend):
 
@@ -59,6 +59,17 @@ class AccessToken(BaseModel):
     claims: dict[str, Any] | None = None  # additional claims (e.g. `iss`, `act`)
 
 
+def principal_components(token: AccessToken) -> tuple[str, str | None, str | None]:
+    """The (client_id, issuer, subject) triple identifying the principal a token represents.
+
+    The single source for "who is this token's principal": session ownership and
+    request-state binding both build on it. Components the token verifier does
+    not supply are `None`, so comparisons degrade to the remaining components.
+    """
+    issuer = (token.claims or {}).get("iss")
+    return token.client_id, str(issuer) if issuer is not None else None, token.subject
+
+
 RegistrationErrorCode = Literal[
     "invalid_redirect_uri",
     "invalid_client_metadata",