[FLINK-38857][Model] Introduce a Triton inference module under flink-models #27385

featzhang · 2026-01-05T03:01:29Z

What is the purpose of the change

This PR introduces a new optional Triton inference module under flink-models, enabling Flink to invoke external NVIDIA Triton Inference Server for batch-oriented model inference.

The module implements a reusable runtime-level integration based on the existing model provider SPI, allowing users to define Triton-backed models via CREATE MODEL and execute inference through ML_PREDICT without modifying the Flink planner or SQL execution semantics.

Brief change log

Added a new flink-model-triton module under flink-models
Implemented a Triton model provider based on the existing model inference framework
Supported asynchronous and batched inference via HTTP/REST API
Added documentation for Triton model usage and configuration
Extended SQL documentation to list Triton as a supported model provider

Verifying this change

Verified module compilation and packaging
Added unit tests for the Triton model provider factory
Manually validated model invocation logic against a local Triton server

Does this pull request potentially affect one of the following parts?

API changes: No
Planner changes: No
Runtime changes: No
SQL semantics changes: No

Documentation

Added dedicated documentation under docs/connectors/models/triton.md
Updated SQL model inference documentation to include Triton as a supported provider

Related issues

FLINK-38857

…for AI inference

flinkbot · 2026-01-05T03:07:00Z

CI report:

d579334 Azure: FAILURE

Bot commands

The @flinkbot bot supports the following commands:

@flinkbot run azure re-run the last Azure build

flink-models/flink-model-triton/README.md

davidradl · 2026-01-05T09:26:32Z

docs/content.zh/docs/connectors/models/triton.md

+
+# Triton
+
+The Triton Model Function allows Flink SQL to call [NVIDIA Triton Inference Server](https://github.com/triton-inference-server/server) for real-time model inference tasks.


I have added some comments, can you ask on the dev list whether this requires a Flip please; to me it seems big enough to warrant a Flip.

davidradl · 2026-01-07T13:42:10Z

docs/content.zh/docs/connectors/models/triton.md

+```sql
+CREATE TEMPORARY VIEW movie_reviews(id, movie_name, user_review, actual_sentiment)
+AS VALUES
+  (1, 'Great Movie', 'This movie was absolutely fantastic! Great acting and storyline.', 'positive'),


nit: I wonder whether -1, 0 and +1 would be more intuitive values.

davidradl · 2026-01-07T13:44:53Z

docs/content.zh/docs/connectors/models/triton.md

+
+Here's an example `config.pbtxt` for a text classification model:
+
+```protobuf


I suggest we explicitly say that this should be in the text-classification/ folder.

davidradl · 2026-01-07T13:45:30Z

docs/content.zh/docs/connectors/models/triton.md

+├── text-classification/
+│   ├── config.pbtxt
+│   └── 1/
+│       └── model.py  # or model.onnx, model.plan, etc.


in the following example what file do we use for model.py

Good question — this refers to the Triton Python backend model file.

In this example, model.py is the Python backend implementation located in the Triton model repository, specifically under:

text-classification/
├── config.pbtxt
└── 1/
└── model.py

The exact contents of model.py are not relevant to Flink itself. Flink interacts with the model only via the Triton HTTP/gRPC inference API, and does not load or execute the model code directly.

To avoid ambiguity, I will update the documentation to explicitly state that this file resides in the text-classification/ model directory.

…for AI inference

[FLINK-38857] Introduce a Triton inference module under flink-models …

cfaf699

…for AI inference

davidradl reviewed Jan 5, 2026

View reviewed changes

flink-models/flink-model-triton/README.md Show resolved Hide resolved

davidradl reviewed Jan 5, 2026

View reviewed changes

flink-models/flink-model-triton/README.md Outdated Show resolved Hide resolved

davidradl reviewed Jan 5, 2026

View reviewed changes

flink-models/flink-model-triton/README.md Show resolved Hide resolved

davidradl reviewed Jan 5, 2026

View reviewed changes

github-actions bot added the community-reviewed PR has been reviewed by the community. label Jan 5, 2026

davidradl reviewed Jan 7, 2026

View reviewed changes

averyzhang added 3 commits January 18, 2026 17:41

[FLINK-38857] Introduce a Triton inference module under flink-models …

c738301

…for AI inference

[FLINK-38857] Introduce a Triton inference module under flink-models …

5c1ac85

…for AI inference

[FLINK-38857] Introduce a Triton inference module under flink-models …

5c4154d

…for AI inference

featzhang changed the title ~~[FLINK-38857][Model] Introduce a Triton inference module under flink-models for batch-oriented AI inference~~ [FLINK-38857][Model] Introduce a Triton inference module under flink-models Jan 18, 2026

[FLINK-38857] Introduce a Triton inference module under flink-models …

d579334

…for AI inference

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[FLINK-38857][Model] Introduce a Triton inference module under flink-models #27385

[FLINK-38857][Model] Introduce a Triton inference module under flink-models #27385

Uh oh!

featzhang commented Jan 5, 2026

Uh oh!

flinkbot commented Jan 5, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

davidradl Jan 5, 2026

Uh oh!

davidradl Jan 7, 2026

Uh oh!

davidradl Jan 7, 2026 •

edited

Loading

Uh oh!

davidradl Jan 7, 2026

Uh oh!

featzhang Jan 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		# Triton

		The Triton Model Function allows Flink SQL to call [NVIDIA Triton Inference Server](https://github.com/triton-inference-server/server) for real-time model inference tasks.


		Here's an example `config.pbtxt` for a text classification model:

		```protobuf

[FLINK-38857][Model] Introduce a Triton inference module under flink-models #27385

Are you sure you want to change the base?

[FLINK-38857][Model] Introduce a Triton inference module under flink-models #27385

Uh oh!

Conversation

featzhang commented Jan 5, 2026

What is the purpose of the change

Brief change log

Verifying this change

Does this pull request potentially affect one of the following parts?

Documentation

Related issues

Uh oh!

flinkbot commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CI report:

Uh oh!

Uh oh!

Uh oh!

Uh oh!

davidradl Jan 5, 2026

Choose a reason for hiding this comment

Uh oh!

davidradl Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

davidradl Jan 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

davidradl Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

featzhang Jan 18, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

flinkbot commented Jan 5, 2026 •

edited

Loading

davidradl Jan 7, 2026 •

edited

Loading