Stop CTST gracefully at a cucumber-level time budget by delthas · Pull Request #2457 · scality/Zenko

delthas · 2026-06-30T13:14:36Z

What

Add an in-process time budget so CTST stops gracefully before the GitHub step timeout would hard-kill it. Touches only tests/functional/ctst/common/hooks.ts and .github/scripts/end2end/run-e2e-ctst.sh.

Why

The only stop for a hung ctst-end2end-sharded run is the step timeout-minutes, which hard-kills the process. cucumber-js v13 installs no signal handler and has no whole-run timeout, so on a kill it never reaches testRunFinished — report.xml is written empty (Error parsing report.xml: no element found), teardown is skipped, and the archive step has nothing to publish.

How

run-e2e-ctst.sh exports CTST_DEADLINE_EPOCH_MS (default 180 min, via CTST_MAX_RUNTIME_MIN) and a marker path, and clears any stale marker.
An early Before hook (registered before the expensive @Quotas/count-items hooks) skips any scenario that starts past the deadline — returning 'skipped' short-circuits the remaining hooks/steps — and drops the marker file.
cucumber then finishes normally: it writes report.xml/report.html/report.ndjson and runs teardown. The runner script exits 1 if the marker is present, so a timed-out run is still red while keeping a clean report.

When CTST_DEADLINE_EPOCH_MS is unset (local runs), the hook is a no-op.

Relationship to ZENKO-5306

Companion to the step-level timeout. ZENKO-5306 lowers the step timeout-minutes to 190 as a hard backstop: 180 min (graceful, this PR) < 190 min (hard backstop) < the old 360-min job cap.

tsc --build, eslint, and bash -n pass.

Issue: ZENKO-5309

The GitHub step timeout hard-kills the process, so Cucumber never writes its reports (empty report.xml). Add an in-process deadline instead: the runner script exports a deadline (default 180m) and a marker path; an early Before hook skips any scenario starting past the deadline (so the expensive @Quotas/count-items hooks are short-circuited) and drops the marker. Cucumber then finishes normally, writing report.xml/html/ndjson and running teardown; the script fails the run if the marker is present. The 190m step timeout (ZENKO-5306) remains as a hard backstop. Issue: ZENKO-5309

bert-e · 2026-06-30T13:14:41Z

Hello delthas,

My role is to assist you with the merge of this
pull request. Please type @bert-e help to get information
on this process, or consult the user documentation.

Available options

name	description	privileged	authored
`/after_pull_request`	Wait for the given pull request id to be merged before continuing with the current one.
`/bypass_author_approval`	Bypass the pull request author's approval	⭐
`/bypass_build_status`	Bypass the build and test status	⭐
`/bypass_commit_size`	Bypass the check on the size of the changeset `TBA`	⭐
`/bypass_incompatible_branch`	Bypass the check on the source branch prefix	⭐
`/bypass_jira_check`	Bypass the Jira issue check	⭐
`/bypass_peer_approval`	Bypass the pull request peers' approval	⭐
`/bypass_leader_approval`	Bypass the pull request leaders' approval	⭐
`/approve`	Instruct Bert-E that the author has approved the pull request.		✍️
`/create_pull_requests`	Allow the creation of integration pull requests.
`/create_integration_branches`	Allow the creation of integration branches.
`/no_octopus`	Prevent Wall-E from doing any octopus merge and use multiple consecutive merge instead
`/unanimity`	Change review acceptance criteria from `one reviewer at least` to `all reviewers`
`/wait`	Instruct Bert-E not to run until further notice.

Available commands

name	description	privileged
`/help`	Print Bert-E's manual in the pull request.
`/status`	Print Bert-E's current status in the pull request `TBA`
`/clear`	Remove all comments from Bert-E from the history `TBA`
`/retry`	Re-start a fresh build `TBA`
`/build`	Re-start a fresh build `TBA`
`/force_reset`	Delete integration branches & pull requests, and restart merge process from the beginning.
`/reset`	Try to remove integration branches unless there are commits on them which do not appear on the source branch.

Status report is not available.

bert-e · 2026-06-30T13:16:07Z

Waiting for approval

The following approvals are needed before I can proceed with the merge:

the author
2 peers

francoisferrand · 2026-06-30T21:19:14Z

+# Before hook), so Cucumber finishes normally and writes its reports. The hook
+# drops a marker file; we fail the run if it is present.
+CTST_MAX_RUNTIME_MIN=${CTST_MAX_RUNTIME_MIN:-180}
+export CTST_DEADLINE_EPOCH_MS=$(( $(date +%s%3N) + CTST_MAX_RUNTIME_MIN * 60 * 1000 ))


best to minimize content of wrapper script : should do this computation in javascript

Suggested change

export CTST_DEADLINE_EPOCH_MS=$(( $(date +%s%3N) + CTST_MAX_RUNTIME_MIN * 60 * 1000 ))

export CTST_MAX_RUNTIME_MIN=${CTST_MAX_RUNTIME_MIN:-180}

even CTST_MAX_RUNTIME_MIN default value could be set directly in the javascript code?

francoisferrand · 2026-06-30T21:22:13Z

+    --format message:ctst/reports/report.ndjson || rc=$?
+
+if [ -f "$CTST_TIMEOUT_MARKER" ]; then
+    echo "::error::CTST exceeded its ${CTST_MAX_RUNTIME_MIN}-minute time budget; remaining scenarios were skipped."


do we need to display a log here ?
we could display the same from within CTST (first time -or everytime- we hit the deadline...), and avoid the whole "marker" handling: which seems overly complex

no need imo, I would display a log in the hook, for each test skipped, we can access the name of the scenario and just write "scenario xxx skipped because of global timeout"

francoisferrand · 2026-06-30T21:24:45Z

+                fs.writeFileSync(marker, 'timeout');
+            } catch { /* best-effort: marker is advisory */ }
+        }
+        return 'skipped';


what does this "skipped" do?
these (skipped) tests need to be marked as somewhat failed, to ensure we don't pass CI unexpectedly: maybe "skipped" does that, but it must a skip like "I could not execute it" (=potentially failed) rather than "test was intentionally disabled/skipped" (=ignore)

I think it's an official cucumber keyword, but yeah this should probably use fail instead 🤔

francoisferrand · 2026-06-30T21:28:59Z

+if [ -f "$CTST_TIMEOUT_MARKER" ]; then
+    echo "::error::CTST exceeded its ${CTST_MAX_RUNTIME_MIN}-minute time budget; remaining scenarios were skipped."
+    if [ "$rc" -eq 0 ]; then
+        rc=1


instead of wrapper script, there is already some code to "tweak" the return code (and make it pass always), c.f. tests/functional/ctst/cucumber.config.cjs :

if (process.env.CI_PASS_ON_TEST_FAILURE === 'true') { const _exit = process.exit; process.exit = function exit(code) { _exit(code === 1 ? 0 : code); }; process.on('beforeExit', () => { if (process.exitCode === 1) { process.exitCode = 0; } }); }

→ instead of this, you could thus tweak the result to actually return something other than 0 if we want to skip further analysis (e.g. test results merging...)
→ however, why do we need to return error 1 in this case? Timeout is likely an issue in the tests, shouldn't we handle this like all other failed tests (i.e. ignore the result and let followup steps check the report for failure)

bert-e · 2026-06-30T21:32:20Z

Request integration branches

Waiting for integration branch creation to be requested by the user.

To request integration branches, please comment on this pull request with the following command:

/create_integration_branches

Alternatively, the /approve and /create_pull_requests commands will automatically
create the integration branches.

SylvainSenechal · 2026-07-01T08:45:58Z

+// Time budget: scenarios that start after the deadline are skipped so Cucumber
+// finishes normally (writing its reports) before the GitHub step timeout would
+// hard-kill the process. The marker file lets the runner script fail the run.
+Before(() => {


I would simplify this :

hardcode here max duration allowed for all ctst tests

send as env variable a timestamp of when we started running ctst (+ verify if maybe this variable is already exposed by cucumber)

just verify time.now < timestampStart + maxDurationAllowed

scality deleted a comment from bert-e Jun 30, 2026

delthas mentioned this pull request Jun 30, 2026

Cap ctst-end2end-sharded test step at 190 min #2454

Merged

delthas requested review from a team, SylvainSenechal and maeldonn June 30, 2026 17:19

francoisferrand requested changes Jun 30, 2026

View reviewed changes

scality deleted a comment from bert-e Jun 30, 2026

SylvainSenechal reviewed Jul 1, 2026

View reviewed changes

SylvainSenechal requested changes Jul 1, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Stop CTST gracefully at a cucumber-level time budget#2457

Stop CTST gracefully at a cucumber-level time budget#2457
delthas wants to merge 1 commit into
development/2.15from
improvement/ZENKO-5309/ctst-time-budget

delthas commented Jun 30, 2026

Uh oh!

bert-e commented Jun 30, 2026

Uh oh!

bert-e commented Jun 30, 2026

Uh oh!

francoisferrand Jun 30, 2026

Uh oh!

francoisferrand Jun 30, 2026

Uh oh!

francoisferrand Jun 30, 2026

Uh oh!

SylvainSenechal Jul 1, 2026

Uh oh!

francoisferrand Jun 30, 2026

Uh oh!

SylvainSenechal Jul 1, 2026

Uh oh!

francoisferrand Jun 30, 2026

Uh oh!

bert-e commented Jun 30, 2026

Uh oh!

SylvainSenechal Jul 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	export CTST_DEADLINE_EPOCH_MS=$(( $(date +%s%3N) + CTST_MAX_RUNTIME_MIN * 60 * 1000 ))
	export CTST_MAX_RUNTIME_MIN=${CTST_MAX_RUNTIME_MIN:-180}

Uh oh!

Conversation

delthas commented Jun 30, 2026

What

Why

How

Relationship to ZENKO-5306

Uh oh!

bert-e commented Jun 30, 2026

Hello delthas,

Uh oh!

bert-e commented Jun 30, 2026

Waiting for approval

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

bert-e commented Jun 30, 2026

Request integration branches

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants