-
Notifications
You must be signed in to change notification settings - Fork 4.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Missing telemetry core.unsealed metrics on standby nodes #10015
Comments
Facing the same issue on 1.6.x versions as well. Is this something that will be fixed? |
Hello! Any update on this? This is blocking us from abandoning vault_exporter. |
This issue is seen on Vault_version: 1.7.2, i m using statsite telemetry provider. |
Wanting to chime in that we're still working on a resolution for this. Thanks for your patience! |
@hsimon-hashicorp Hi, have any updates on this? It's important to emit metrics on the standby node in HA mode. |
We just upgraded to Vault 1.11.3. We saw all Vault replicas export I'm going to guess they just hadn't finished determining they were a cluster yet, and as soon as they went into HA standby mode, the standbys started hitting this bug and not reporting the metric. |
Just some more 2 cents "vault.core.unsealed" is missing, but the very basic "vault.core.active" is also missing .... |
I see the same behaviour with Vault 1.12.1 and missing vault_core_active metric after some time. We have a 3 node Vault setup with Raft storage deployed in K8s. |
@hsimon-hashicorp any updates about this issue? the lack of reliable core metrics makes it very difficult to properly monitor vault using prometheus. |
also fased this problem, also needs resolution |
+1 |
Hello, Any update? This really makes the unsealed metric useless. Thanks. |
Any update on this? Its been more than 3 years... The issue is still open |
Do I understand right? There is no way of knowing with prometheus if a VM on a HA cluster is sealed as long as some are unsealed. Does anyone find a solution to this? I really do not want to wait until (the whole cluster) vault is sealed before I get an alert. It defeats the purpose of HA setup where you can fix issues as they happen while keeping Vault unsealed. I just check and this seems the case for Enterprise Vault too |
WORKAROUND: makes sense? |
for warning alert (part of vault instance sealed)
for critical alert I think this is a problem needs to be solved, but currently I can only use this workaround |
Is this still an issue? I'm seeing "The /v1/sys/metrics endpoint is only accessible on active nodes and automatically disabled on standby nodes. You can enable the /v1/sys/metrics endpoint on standby nodes by enabling unauthenticated metrics access." This is on an HA setup in K8s with the Vault Helm chart v0.25.0 and Vault v1.14.0. When all are sealed:
When all are unsealed:
Make sure to set the I ran into another issue specific to the Vault Helm chart that caused metrics to disappear when all Vault pods are sealed, which we had to work around: hashicorp/vault-helm#990 And I'm running into another problem that I'm about to file an issue for where specifically the |
Hi folks. Just checking through older bugs. As @cascadia-sati mentioned, it would seem like this is not an issue any more, does anyone on this thread still see this problem? As far as I can see this was actually fixed by #12166 a couple of years ago - I've looked through the code and can confirm that now Closing for now, please let us know if someone is still seeing this on a version of Vault after 1.13.0. |
Describe the bug
This is a follow-up on slightly different #9771
To Reproduce
Steps to reproduce the behavior:
vault.core.unsealed
metric is missingExpected behavior
A clear and concise description of what you expected to happen.
vault.core.unsealed
metric should be present like on active (Leader) node:The lack of
core.unsealed
metrics for a HA standing-by node is problematic since it prevents to monitor the health of all HA nodes (and make sure HA is actually still available).Environment:
vault status
): 1.5.0vault version
): 1.5.0Vault server configuration file(s):
N/A
Additional context
Quickly going through
core.go
,core_metrics.go
andha.go
, it seems to me thatemitMetrics
(which spawns themetrics Loop
to refresh thecore.unsealed
metric) is only called viapostUnseal
, which is not called for HA standing-by HA node (incore.go
); only the Leader/Active node actually callspostUnseal
(inha.go
).The text was updated successfully, but these errors were encountered: