-
Notifications
You must be signed in to change notification settings - Fork 96
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
c-s fails to prepare test table during disrupt_truncate
nemesis, but the test continuous and starts the disruption
#8722
Comments
should have a quick fix to catch it, raise an error and exit the nemesis. |
|
the 2nd point was addressed in: |
reproduced there: PackagesScylla version: Kernel Version: Installation detailsCluster size: 6 nodes (i4i.4xlarge) Scylla Nodes used in this run:
OS / Image: Test: Logs and commands
Logs:
|
this nemesis is not supported by znodes, fix is on the way: #9342 |
All stress command running from within nemesis code, should be able to check results and fail early on |
Abort the nemesis flow early when a stress command fails, as continuing it is often invalid due to the cluster being in an unexpected state. For example, if the _prepare_test_table routine fails, subsequent steps that depend on the test table (or attempt disruptions on top of it) will also fail. This change adds a check for results of stress command triggered within the nemesis code, ensuring the nemesis halts early if the stress command is unsuccessful. Fixes: scylladb#8722
Abort the nemesis flow early when a stress command fails, as continuing it is often invalid due to the cluster being in an unexpected state. For example, if the _prepare_test_table routine fails, subsequent steps that depend on the test table (or attempt disruptions on top of it) will also fail. This change adds a check for results of stress command triggered within the nemesis code, ensuring the nemesis halts early if the stress command is unsuccessful. Fixes: scylladb#8722
Abort the nemesis flow early when a stress command fails, as continuing it is often invalid due to the cluster being in an unexpected state. For example, if the _prepare_test_table routine fails, subsequent steps that depend on the test table (or attempt disruptions on top of it) will also fail. This change adds a check for results of stress command triggered within the nemesis code, ensuring the nemesis halts early if the stress command is unsuccessful. Fixes: scylladb#8722
At the beginning of
disrupt_truncate
nemesis the test ks/table are prepared with the c-s command:The command fails with the error:
Even though the c-s command failed the nemesis continues and starts the truncate disruption which fails with:
Installation details
Cluster size: 4 nodes (Standard_L16s_v3)
Scylla Nodes used in this run:
OS / Image:
/subscriptions/6c268694-47ab-43ab-b306-3c5514bc4112/resourceGroups/SCYLLA-IMAGES/providers/Microsoft.Compute/images/scylla-6.2.0-dev-x86_64-2024-09-13T02-56-40
(azure: undefined_region)Test:
longevity-1tb-5days-azure-test
Test id:
ce64f53c-084b-4445-8b62-784fa80adf1c
Test name:
scylla-master/tier1/longevity-1tb-5days-azure-test
Test method:
longevity_test.LongevityTest.test_custom_time
Test config file(s):
Logs and commands
$ hydra investigate show-monitor ce64f53c-084b-4445-8b62-784fa80adf1c
$ hydra investigate show-logs ce64f53c-084b-4445-8b62-784fa80adf1c
Logs:
Jenkins job URL
Argus
The text was updated successfully, but these errors were encountered: