Client-side Chat History Maintenance #893

tomfrenken · 2025-07-29T09:23:29Z

tomfrenken
Jul 29, 2025

Chat History Maintenance

Status

Accepted

Context

With the introduction of chat history maintenance, we must distinguish between client-side and server-side approaches.
This ADR primarily focuses on client-side history maintenance, but also outlines the alternative server-side approach for context.

Server-Side History Maintenance

In server-side maintenance, the server is responsible for storing the chat history.

A prominent example is OpenAI’s conversation state, where conversations can branch off from any given response_id, continuing from that point in history.

Ideally, this would be implemented by the orchestration service, which could:

Unify APIs across different vendors
Enable history support for vendors that do not yet offer it natively

Benefits:

Reduced ingress traffic: Only the latest user message needs to be sent, not the full conversation history.
Optimization potential: History could be pre-vectorized, enabling efficient input by attaching only the relevant context.

Client-Side History Maintenance

In this model, the client becomes stateful, tracking the ongoing conversation locally.

This approach introduces several responsibilities:

History initialization
Tenant isolation
History retrieval
Streaming considerations

History Initialization

Client-side history can be initialized in two ways:

New conversation: Starts with an empty history
Existing conversation: Initializes with a preloaded history

The client must support both scenarios.

Tenant Isolation

To ensure multi-tenant safety (similar to Cloud SDK destination isolation), chat histories must be isolated per tenant.

Options include:

User-managed: Users instantiate a new client per tenant
Client-managed: Built-in isolation via a token or tenant identifier

History Retrieval

Since production environments often favor stateless services, history may be offloaded to external databases (e.g., Redis, NoSQL) for durability and scalability.

To support this, the client should expose APIs that allow:

Full history export
Incremental updates (for batch or real-time sync)

client.getChatHistory(); // Returns all messages
client.getChatHistoryIncrement(); // Returns messages since the last call

Streaming Considerations

When handling multiple concurrent streams, conversation isolation becomes critical.

Two design choices emerge:

Stream Lock
Prevent multiple simultaneous streams per client.
- Pros: Simple to implement
- Cons: Limits concurrency
- Risk: If not locked, stream chunks could bleed into a shared history
Conversation IDs
Create isolated “conversations” client-side, allowing concurrent streams.
- Pros: Scalable and concurrent
- Cons: Requires the introduction of a conversation ID concept (not currently supported by vendors)

Proposal:

We instantiate a new client for every conversation; this way, we can simplify multiple concerns:

We leave tenant-isolation concerns to our users
We don't need to handle internal conversation id's, instead a simple stream-lock
Conversations are only created at client instantiation, where we can pass history as an instantiation parameter

Decision

Proposal accepted as-is

deekshas8 · 2025-07-30T12:19:38Z

deekshas8
Jul 30, 2025
Maintainer

a new client for every conversation

I like the proposal.

0 replies

KavithaSiva · 2025-07-31T08:53:17Z

KavithaSiva
Jul 31, 2025
Maintainer

I have some follow-up questions about the proposal:

We leave tenant-isolation concerns to our users

The concept of tenant is a bit confusing for me here, are you referring to the tenants within SAP AI Core or BTP Tenants? If it's the former, isn't it already handled by the resourceGroup parameter during client initialisation?

We instantiate a new client for every conversation

Is this a soft requirement ? Or will we enforce this somehow in code?

1 reply

tomfrenken Jul 31, 2025
Author

As discussed in the daily:
Tenant in this context can be any user within the target application. AI Core "tenants" are not suitable for isolation. BTP tenants, or an application's own tenant concept, can be considered here, regardless, we decide to not introduce any isolation and leave it to the user.

The new client is a hard requirement, enforced by disabling passing history outside of instantiation.

tomfrenken · 2025-07-31T10:44:01Z

tomfrenken
Jul 31, 2025
Author

After our discussions I consider this closed and accepted.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Client-side Chat History Maintenance #893

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 3 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Client-side Chat History Maintenance #893

Uh oh!

Uh oh!

tomfrenken Jul 29, 2025

Chat History Maintenance

Status

Context

Server-Side History Maintenance

Client-Side History Maintenance

History Initialization

Tenant Isolation

History Retrieval

Streaming Considerations

Proposal:

Decision

Replies: 3 comments · 1 reply

Uh oh!

deekshas8 Jul 30, 2025 Maintainer

Uh oh!

Uh oh!

KavithaSiva Jul 31, 2025 Maintainer

Uh oh!

tomfrenken Jul 31, 2025 Author

Uh oh!

tomfrenken Jul 31, 2025 Author

tomfrenken
Jul 29, 2025

Replies: 3 comments 1 reply

deekshas8
Jul 30, 2025
Maintainer

KavithaSiva
Jul 31, 2025
Maintainer

tomfrenken Jul 31, 2025
Author

tomfrenken
Jul 31, 2025
Author