Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
58 commits
Select commit Hold shift + click to select a range
96a80ed
Add RF-DETR Triton postproc flag
aseembits93 Jun 2, 2026
83b6f69
Add batched instance RLE alignment helper
aseembits93 Jun 2, 2026
6f9d7ca
Add RF-DETR Triton sparse RLE postprocess
aseembits93 Jun 2, 2026
2ddba5e
Wire RF-DETR RLE postprocess fast path
aseembits93 Jun 2, 2026
0da43d0
Add RF-DETR Triton postproc tests
aseembits93 Jun 2, 2026
036d593
Keep RF-DETR default RLE path unchanged
aseembits93 Jun 2, 2026
54d125e
Remove RF-DETR dense batch RLE postproc path
aseembits93 Jun 2, 2026
43ee7e6
Remove legacy RF-DETR Triton postproc env flag
aseembits93 Jun 2, 2026
2a4bf73
Cover RF-DETR Triton top-k postproc retry
aseembits93 Jun 2, 2026
0869d78
Keep RF-DETR RLE reference postprocess names
aseembits93 Jun 2, 2026
6d3ccb8
Document RF-DETR Triton postprocess internals
aseembits93 Jun 2, 2026
8b74db8
Remove dead RF-DETR top-k query metadata path
aseembits93 Jun 2, 2026
ff3e2fb
Document RF-DETR Triton kernel contracts
aseembits93 Jun 2, 2026
85a3215
Restore RF-DETR postproc benchmark harnesses
aseembits93 Jun 3, 2026
60dea68
Warn on unsupported RF-DETR Triton postprocess path
aseembits93 Jun 3, 2026
e64a878
Force sparse RF-DETR masks in postproc benchmark
aseembits93 Jun 3, 2026
0558da5
perf(rfdetr-seg): fused Triton preproc kernel for TRT path
aseembits93 May 9, 2026
1b8d6c1
perf(rfdetr): add kill switch for Triton preproc fast path
aseembits93 May 11, 2026
f959d81
Optimize RF-DETR Triton preprocessing
aseembits93 May 29, 2026
be8e0ff
Consolidate RF-DETR Triton preproc kernel
aseembits93 Jun 2, 2026
488a703
Align Triton preproc env integration
aseembits93 Jun 2, 2026
6441d8e
Tighten RF-DETR Triton preprocessing runtime
aseembits93 Jun 2, 2026
c8d59a7
Tighten RF-DETR Triton preproc review coverage
aseembits93 Jun 3, 2026
a923560
Add RF-DETR Triton preproc integration coverage
aseembits93 Jun 3, 2026
6bfbbaa
Clarify RF-DETR Triton preproc runtime
aseembits93 Jun 3, 2026
ab811f2
Tighten RF-DETR Triton preproc review coverage
aseembits93 Jun 3, 2026
e3d8543
roboflow/inference-server-experimental for building images from feat/…
grzegorz-roboflow Jun 3, 2026
d0e13c0
fix batch processing (#2411)
digaobarbosa Jun 4, 2026
b54110a
Nk/add volume support (#2413)
nkuneman Jun 4, 2026
1e3d5a9
Add roboflow_core/current_time@v1 Workflow block (#2410)
patricknihranz Jun 4, 2026
347785e
Update depenencies to fix main (#2415)
PawelPeczek-Roboflow Jun 4, 2026
d82595f
Gate RF-DETR fast preprocess by pipeline depth
aseembits93 Jun 4, 2026
d6d131c
Gate RF-DETR preprocess by preproc flag
aseembits93 Jun 4, 2026
80fe467
Support RF-DETR Keypoints model (#2401)
sergii-bond Jun 4, 2026
cf7726c
Add sparse RF-DETR RLE polygon conversion
aseembits93 May 30, 2026
d188bcf
Improve RF-DETR stream pipeline scheduling
aseembits93 Jun 2, 2026
120d302
Restore RF-DETR pipeline submit ordering
aseembits93 Jun 2, 2026
2d5a9a5
Add bitpacked RF-DETR polygon helper
aseembits93 Jun 2, 2026
a86a224
Prepare RF-DETR pipeline integration for review
aseembits93 Jun 2, 2026
5ccefce
Keep RF-DETR postprocess deferral in pipeline stack
aseembits93 Jun 2, 2026
504e1c6
Keep RF-DETR uncompressed RLE counts in pipeline stack
aseembits93 Jun 2, 2026
b0a340c
Route RF-DETR deferred postprocess only through Triton
aseembits93 Jun 2, 2026
5d8f659
Document RF-DETR deferred postprocess helpers
aseembits93 Jun 2, 2026
650526e
Restore deferred RF-DETR query metadata path
aseembits93 Jun 2, 2026
942e34a
Document RF-DETR deferred kernel metadata contract
aseembits93 Jun 2, 2026
938a794
Remove RF-DETR pipeline profiling hooks
aseembits93 Jun 3, 2026
ae6c5b6
Tighten RF-DETR pipeline review coverage
aseembits93 Jun 3, 2026
34b6022
Clarify RF-DETR pipeline async handoff
aseembits93 Jun 3, 2026
dd8282c
Tighten RF-DETR pipeline review readiness
aseembits93 Jun 3, 2026
33af52d
Gate RF-DETR stream pipeline by model capability
aseembits93 Jun 3, 2026
4b1645f
Use existing Roboflow frame handler
aseembits93 Jun 3, 2026
a317163
Isolate pipelined workflow runner
aseembits93 Jun 3, 2026
5d799bf
Fix late stream pipeline activation buffering
aseembits93 Jun 3, 2026
30bd49f
Clarify RF-DETR TRT async stream handoff
aseembits93 Jun 3, 2026
9582947
Gate RF-DETR opt paths behind disabled defaults
aseembits93 Jun 4, 2026
5bfd635
Merge branch 'main' into opt-preprocess+opt-python-postproc+opt-pipel…
aseembits93 Jun 4, 2026
16e26e1
Fix RF-DETR preprocess buffer handoff
aseembits93 Jun 4, 2026
61060cf
Remove orphan RF-DETR postprocess block
aseembits93 Jun 4, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
118 changes: 118 additions & 0 deletions .github/workflows/docker.inference_server.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,118 @@
name: Build inference_server images

# Manual only. When you click "Run workflow", pick your branch as the ref —
# it builds that ref's code (never main on its own). Tick the targets you want;
# tick none = build all. (workflow_dispatch only appears in the UI once this
# file is on the default branch — a one-time merge; it still never auto-runs.)
permissions:
contents: read
on:
workflow_dispatch:
inputs:
gpu:
type: boolean
description: "Build GPU (cu124, amd64)"
default: false
cpu:
type: boolean
description: "Build CPU (amd64)"
default: false
jetson_5_1_1:
type: boolean
description: "Build Jetson JP5.1.1 (arm64)"
default: false
jetson_6_0_0:
type: boolean
description: "Build Jetson JP6.0 (arm64)"
default: false
jetson_6_2_0:
type: boolean
description: "Build Jetson JP6.2 (arm64)"
default: false
tag_suffix:
type: string
description: "Postfix appended to image tag, e.g. -test → :gpu-0.1.0-test"
default: ""
push:
type: boolean
description: "Push images after build (otherwise build-only)"
default: false

env:
IMAGE: roboflow/inference-server-experimental

jobs:
setup:
runs-on: ubuntu-latest
outputs:
matrix: ${{ steps.set.outputs.matrix }}
version: ${{ steps.ver.outputs.version }}
steps:
- name: Checkout
uses: actions/checkout@v6
- name: Read inference_server version
id: ver
run: |
v=$(python -c "import tomllib; print(tomllib.load(open('inference_server/pyproject.toml','rb'))['project']['version'])")
echo "version=$v" >> "$GITHUB_OUTPUT"
echo "inference_server version: $v"
- name: Compute build matrix (none selected = all)
id: set
env:
GPU: ${{ inputs.gpu }}
CPU: ${{ inputs.cpu }}
J511: ${{ inputs.jetson_5_1_1 }}
J600: ${{ inputs.jetson_6_0_0 }}
J620: ${{ inputs.jetson_6_2_0 }}
run: |
set -euo pipefail
all_json=$(cat <<EOF
[
{"name":"gpu","dockerfile":"inference_server/docker/Dockerfile.gpu","platform":"linux/amd64","depot_project":"grl7ffzxd7","sel":"$GPU"},
{"name":"cpu","dockerfile":"inference_server/docker/Dockerfile.cpu","platform":"linux/amd64","depot_project":"grl7ffzxd7","sel":"$CPU"},
{"name":"jetson-5.1.1","dockerfile":"inference_server/docker/Dockerfile.jetson.5.1.1","platform":"linux/arm64","depot_project":"2rp7mfjw7q","sel":"$J511"},
{"name":"jetson-6.0.0","dockerfile":"inference_server/docker/Dockerfile.jetson.6.0.0","platform":"linux/arm64","depot_project":"2rp7mfjw7q","sel":"$J600"},
{"name":"jetson-6.2.0","dockerfile":"inference_server/docker/Dockerfile.jetson.6.2.0","platform":"linux/arm64","depot_project":"2rp7mfjw7q","sel":"$J620"}
]
EOF
)
selected=$(echo "$all_json" | jq -c '[.[] | select(.sel=="true")]')
if [ "$(echo "$selected" | jq 'length')" -eq 0 ]; then
selected="$all_json"
fi
matrix=$(echo "$selected" | jq -c '{include: [.[] | {name, dockerfile, platform, depot_project}]}')
echo "matrix=$matrix" >> "$GITHUB_OUTPUT"
echo "$matrix" | jq .

build:
needs: setup
runs-on: ubuntu-latest
timeout-minutes: 180
permissions:
id-token: write
contents: read
strategy:
fail-fast: false
matrix: ${{ fromJson(needs.setup.outputs.matrix) }}
name: build-${{ matrix.name }}
steps:
- name: Checkout
uses: actions/checkout@v6
- name: Set up Depot CLI
uses: depot/setup-action@v1
# jp5.1.1 base (roboflow/l4t-ml) is private on Docker Hub.
- name: Login to Docker Hub
uses: docker/login-action@v3
with:
username: ${{ secrets.DOCKERHUB_USERNAME }}
password: ${{ secrets.DOCKERHUB_TOKEN }}
- name: Build (and optionally push)
uses: depot/build-push-action@v1
with:
project: ${{ matrix.depot_project }}
context: .
file: ${{ matrix.dockerfile }}
platforms: ${{ matrix.platform }}
pull: true
push: ${{ inputs.push }}
tags: ${{ env.IMAGE }}:${{ matrix.name }}-${{ needs.setup.outputs.version }}${{ inputs.tag_suffix }}
Loading
Loading