You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
I'm using the Vault Helm chart to deploy Vault to our K8s cluster using a three-replica HA setup. While working on monitoring for the seal status, I noticed that the vault_core_unsealed metric mysteriously disappears after some time. This discussion hinted that it may be due to the prometheus_retention_time config field. That does indeed seem to be the case.
Increasing it does make the metric stick around longer, but only until the retention time elapses. When the seal status does change, the metric comes back, and alerting can kick in, but in the meantime our dashboards and alerts show "no data", which is not ideal.
To Reproduce
Simply install Vault and watch the vault_core_unsealed metric disappear after prometheus_retention_time amount of time.
Expected behavior
The vault_core_unsealed metric should persist, even if it doesn't change. I'm assuming Vault needs to be modified to publish the metric periodically instead of only when it's changed.
Environment:
Vault Server Version (retrieve with vault status): v1.14.0
Vault CLI Version (retrieve with vault version): v1.14.0
Server Operating System/Architecture: Vault running on Kubernetes EKS installed via v0.25.0 of the Vault Helm chart
Vault server configuration file(s):
(My Helm values file with the Vault config embedded)
Along with hashicorp/vault-helm#990, this has made monitoring the seal status very difficult in our HA Vault setup on K8s. I'm still somewhat new to DevOps, so I was surprised and somewhat disappointed to find this not well baked. Surely we're not the only ones who find it important to monitor the seal status. If there's some better way to do this, or I'm missing something, please point me in the right direction.
This seems to only happen when the metric goes to "0" for "sealed. It then disappears after the retention time. However, when the metric is "1" for "unsealed", it persists even after the retention time.
The image below is an example with a retention time of five minutes. See the metrics disappear after that time when Vault seals. When it's unsealed, the metric persists.
Describe the bug
I'm using the Vault Helm chart to deploy Vault to our K8s cluster using a three-replica HA setup. While working on monitoring for the seal status, I noticed that the
vault_core_unsealed
metric mysteriously disappears after some time. This discussion hinted that it may be due to theprometheus_retention_time
config field. That does indeed seem to be the case.Increasing it does make the metric stick around longer, but only until the retention time elapses. When the seal status does change, the metric comes back, and alerting can kick in, but in the meantime our dashboards and alerts show "no data", which is not ideal.
To Reproduce
Simply install Vault and watch the
vault_core_unsealed
metric disappear afterprometheus_retention_time
amount of time.Expected behavior
The
vault_core_unsealed
metric should persist, even if it doesn't change. I'm assuming Vault needs to be modified to publish the metric periodically instead of only when it's changed.Environment:
vault status
): v1.14.0vault version
): v1.14.0Vault server configuration file(s):
(My Helm values file with the Vault config embedded)
Additional context
Add any other context about the problem here.
The text was updated successfully, but these errors were encountered: