-
Notifications
You must be signed in to change notification settings - Fork 110
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Volume failed to attach: No online replicas are available #1728
Comments
Errors like these on two of the io-engine node logs(node 46 and 47) explain the missing info in etcd. One thing which could've possibly lead to this is that the persist during nexus create didn't succeed, and later on persist during the nexus shutdown succeeded.
There are lot of other connectivity errors on these io-engines. e.g |
One of the node in this cluster of 3 nodes was rebooted. |
We ran into another issue where the NexusSpec in etcd is missing.
The related key: What is the best way detect this error and recover? |
hmm it does seem present:
There are several connection issues which are fixed, I suggest you upgrade to 2.7.0 (though also seeing a bug which is fixed and will be released in v2.7.1 soon) - anyway 2.7.0 will already be better. |
@tiagolobocastro
The following is the command ran while in the error state:
Agent-core has error about missing Nexus:
|
You can try to scale down the app trying to use the volume. |
Tried deleting the pod. If the pod came back on the same node, the nexus didn't "reset". The "reset" only happens if the pod is schedule to a different node. Regarding upgrade to 2.7.0, the mayastor plugin not recovering after 1 node down is really a blocker for us to upgrade ( I filed an issue for it). |
So the nexus did not get deleted?
Is it this issue? #1715 |
When you run with replicas>1 the mayastor-plugin requests can hit any of the api-rest pods At this point now all api-rest requests succeed again. |
Mayastor 2.5.1:
Pod is failing to run with:
etcd shows:
Attaching the support bundle
mayastor-no-online.tar.gz
The text was updated successfully, but these errors were encountered: