Skip to content

Conversation

@micheleRP
Copy link
Contributor

@micheleRP micheleRP commented Jan 6, 2026

Description

This pull request introduces the AI Gateway in Redpanda Cloud.

  • Added a detailed quickstart guide for the AI Gateway, covering setup, configuration, routing, integration with popular tools (such as OpenAI and Anthropic SDKs, Claude Code, VS Code extensions, and Cursor IDE), and advanced features like CEL-based routing and deferred tool loading for token savings.

Resolves https://redpandadata.atlassian.net/browse/DOC-1867
Review deadline: GA = Feb 6

Page previews

AI Gateway Quickstart

Checks

  • New feature
  • Content gap
  • Support Follow-up
  • Small fix (typos, links, copyedits, etc)

@coderabbitai
Copy link
Contributor

coderabbitai bot commented Jan 6, 2026

Important

Review skipped

Draft detected.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.


Comment @coderabbitai help to get the list of available commands and usage tips.

@netlify
Copy link

netlify bot commented Jan 6, 2026

Deploy Preview for rp-cloud ready!

Name Link
🔨 Latest commit 07628a0
🔍 Latest deploy log https://app.netlify.com/projects/rp-cloud/deploys/69699aef74e0dd00081e849a
😎 Deploy Preview https://deploy-preview-477--rp-cloud.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.


The orchestrator is a built-in MCP server that enables programmatic tool calling. The agent can generate JavaScript to call multiple tools in a single orchestrated step, which reduces the number of round trips. For example, a workflow requiring 47 file reads can be reduced from 49 round trips to just 1.

*REVIEWERS: When/how exactly do you use the orchestrator? Also what happens after they create a gateway? Please provide an example of how to validate end-to-end routing against the gateway endpoint!*
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it is an MCP server exposing tools, so it is used as another regular MCP server. I can provide examples.


This can reduce token usage significantly (for example, 80-90% depending on how many servers/tools are configured).

== Observability
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I haven't gotten to the observability implementation yet, probably next week.


This is central to governance: You can see and control usage by gateway boundary (for example, by team, environment, customer, or product).

*REVIEWERS: Where do those metrics appear in the UI, or how does a user validate observability after setup?*
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It is not implemented yet.

@micheleRP micheleRP force-pushed the DOC-1867-Document-feature-AI-Gateway-help-cloud-team-polish-clean-up branch from 9aef822 to b28bea7 Compare January 6, 2026 20:32
----
{
"claude.mcpServers": {
"redpanda-ai-gateway": {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This configuration should use http as transport. For example, to configure our internal MCP gateway in Claude Code, it would look roughly like this:

claude mcp add --transport http redpanda-aigateway https://gw.ai.panda.dev/mcp --header "Authorization: Bearer YOUR_API_KEY" --header "rp-aigw-id: GATEWAY_ID"

Copy link
Member

@c4milo c4milo Jan 6, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There is also a way to disallow people from using other MCP servers. I'm going to have to dedicate some time to documenting and testing all the integrations with popular AI coding agents.

micheleRP and others added 12 commits January 14, 2026 16:58
# Conflicts:
#	modules/ROOT/nav.adoc
Added 7 new documentation files for AI Gateway:
- what-is-ai-gateway.adoc: Overview, problem/solution framing, common patterns
- quickstart-enhanced.adoc: Step-by-step quickstart with time markers
- observability-logs.adoc: Request logs, filtering, and debugging
- observability-metrics.adoc: Dashboards, analytics, and cost tracking
- migration-guide.adoc: Safe migration from direct provider integration
- cel-routing-cookbook.adoc: CEL routing patterns with examples
- mcp-aggregation-guide.adoc: MCP aggregation and orchestration

All files follow Redpanda documentation standards:
- Sentence case headings
- Imperative verbs for action headings
- AsciiDoc format
- Comprehensive placeholders for product-specific details

Co-Authored-By: Claude Sonnet 4.5 <[email protected]>
Add personas, learning objectives, and prerequisites to all AI Gateway
documentation pages. Remove DRAFT prefixes from titles and time estimates
from quickstart. Fix passive voice in multiple locations.

Changes:
- Add page-personas attributes to all 7 files
- Add learning objectives in ABCD format
- Add prerequisites sections where missing
- Remove "DRAFT:" from all page titles
- Remove time estimates from quickstart-enhanced.adoc
- Fix passive voice constructions
- Improve page descriptions
- Preserve all placeholder comments for future content

Co-Authored-By: Claude Sonnet 4.5 <[email protected]>
@micheleRP micheleRP force-pushed the DOC-1867-Document-feature-AI-Gateway-help-cloud-team-polish-clean-up branch from 1297ef0 to 49a573f Compare January 15, 2026 00:01
micheleRP and others added 2 commits January 14, 2026 19:10
Provide admin and user guides for configuring Claude Code, Cline, Continue.dev, Cursor IDE, and GitHub Copilot to work with AI Gateway, enabling centralized LLM routing and MCP tool aggregation.

Co-Authored-By: Claude Sonnet 4.5 <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants