-
Notifications
You must be signed in to change notification settings - Fork 98
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[K8S] nodetool_enospc
nemesis (IOChaos experiment
) causes target Scylla service to exit with code 1
and be restarted
#6327
Comments
@soyacz have you seen this problem? |
nodetool_enospc
nemesis (IOChaos experiment
) causes target Scylla service to exit with code 1
and be restartednodetool_enospc
nemesis (IOChaos experiment
) causes target Scylla service to exit with code 1
and be restarted
Sounds like the expected flow, if in case of ENSPC Scylla dies, and we should wait for it to be back up. |
Nevertheless, scylla restarts, finds enospc errors and later must restart again. The test stops due end of time or some other error? |
The tearDown starts right after the error described here caused by the end of the stress:
|
@vponomaryov maybe this error happens due new way of provisioning disks? Maybe now we can fill up node directory up to the full instead of playing with os api? |
It may be related to the new way of the storage setup, and if so, then, probably, it is related to the XFS configuration. |
Retesting other fixes for other bugs this one was hit all the time. |
The 'nodetool_enospc' nemesis fails all the time on K8S [1]. So, skip it until the the bug gets fixed. [1] scylladb#6327
The 'nodetool_enospc' nemesis fails all the time on K8S [1]. So, skip it until the the bug gets fixed. [1] scylladb#6327
The 'nodetool_enospc' nemesis fails all the time on K8S [1]. So, skip it until the the bug gets fixed. [1] #6327
I think we no longer can use Chaos Mesh for this - looks previously io_submit() was not used (IIUC scylla detects if it is feasible to use it) and the used filesystem mock (https://docs.rs/fuser/0.7.0/fuser/trait.Filesystem.html) does not support it (and doesn't look it will). We need to rewrite this nemesis to use different means of filling up disk. |
Let's file a bug on https://github.com/cberner/fuser, with the details, and why we need io_submit support. and meanwhile lets disable this nemesis. we'll then consider other option based on the response from community. |
Issues created: In the meanwhile @vponomaryov already disabled this nemesis. |
Prerequisites
Versions
Logs
Description
SCT logs have following:
And the logs of the target node:
We see that Scylla service exits with code
1
as a reaction to theIOChaos
experiment.The nemesis is marked as
passed
but the test run stops after this error.Steps to Reproduce
disrupt_enospc
nemesisExpected behavior: Scylla service must continue to run
Actual behavior: Scylla service immediately exits with code
1
.The text was updated successfully, but these errors were encountered: