Bug: facing challenges with recently changed prometheus metrics naming convention #5601
Closed
2 tasks done
Labels
bug
Something isn't working
pager-duty
Bugs / Issues found while on pager duty
SRE Reported
SRE Reported issues
📜 Description
Not receiving any alerts due to recent change in prometheus metrics naming convention.
earlier : nats_event_consumption_time_count
now : Nats_Event_Consumption_Time_count
Due to this change we are not receiving any alerts and also this is changed only for all streams except image-scanner due to which we will need to create segregated alerts for image-scanner NATS metrics.
Affected areas
Other CRITICAL functionality
Additional affected areas
Other CRITICAL functionality
Prod/Non-prod environments?
Prod
Is User unblocked?
No
How was the user un-blocked?
None
Impact on Enterprise
Unable to track anomalies in NATS like nats event consumption time etc.
👟 Steps to replicate the Issue
if you plot a graph in grafana over nats_event_consumption_time_count then you will see the data stopped coming after a certain time duration.
👍 Expected behavior
metrics naming convention should be same across all exposed metrics.
👎 Actual Behavior
Currently all metrics naming convention is lower case except for NATs which is camel case.
☸ Kubernetes version
EKS 1.30
Cloud provider
🌍 Browser
Chrome
✅ Proposed Solution
NA
👀 Have you spent some time to check if this issue has been raised before?
🏢 Have you read the Code of Conduct?
The text was updated successfully, but these errors were encountered: