Bug: facing challenges with recently changed prometheus metrics naming convention #5601

tayalrishabh96 · 2024-07-31T11:04:22Z

📜 Description

Not receiving any alerts due to recent change in prometheus metrics naming convention.
earlier : nats_event_consumption_time_count
now : Nats_Event_Consumption_Time_count
Due to this change we are not receiving any alerts and also this is changed only for all streams except image-scanner due to which we will need to create segregated alerts for image-scanner NATS metrics.

Affected areas

Other CRITICAL functionality

Additional affected areas

Other CRITICAL functionality

Prod/Non-prod environments?

Prod

Is User unblocked?

No

How was the user un-blocked?

None

Impact on Enterprise

Unable to track anomalies in NATS like nats event consumption time etc.

👟 Steps to replicate the Issue

if you plot a graph in grafana over nats_event_consumption_time_count then you will see the data stopped coming after a certain time duration.

👍 Expected behavior

metrics naming convention should be same across all exposed metrics.

👎 Actual Behavior

Currently all metrics naming convention is lower case except for NATs which is camel case.

☸ Kubernetes version

EKS 1.30

Cloud provider

AWS

🌍 Browser

Chrome

✅ Proposed Solution

NA

👀 Have you spent some time to check if this issue has been raised before?

I checked and didn't find any similar issue

🏢 Have you read the Code of Conduct?

I have read the Code of Conduct

github-actions · 2024-07-31T11:04:38Z

Final Score: 240

tayalrishabh96 added bug Something isn't working pager-duty Bugs / Issues found while on pager duty labels Jul 31, 2024

tayalrishabh96 assigned prakarsh-dt, vikramdevtron, vivek-devtron and kripanshdevtron Jul 31, 2024

github-actions bot removed the pager-duty Bugs / Issues found while on pager duty label Jul 31, 2024

tayalrishabh96 added the pager-duty Bugs / Issues found while on pager duty label Jul 31, 2024

prkhrkat mentioned this issue Jul 31, 2024

feat: Event names fix as per prometheus devtron-labs/common-lib#96

Merged

prakarsh-dt added the SRE Reported SRE Reported issues label Aug 2, 2024

prakarsh-dt closed this as completed Aug 2, 2024

prkhrkat reopened this Aug 2, 2024

Ash-exp mentioned this issue Aug 9, 2024

fix: dependabot security updates #5608

Merged

7 tasks

Ash-exp closed this as completed in #5608 Aug 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug: facing challenges with recently changed prometheus metrics naming convention #5601

Bug: facing challenges with recently changed prometheus metrics naming convention #5601

tayalrishabh96 commented Jul 31, 2024

github-actions bot commented Jul 31, 2024

Bug: facing challenges with recently changed prometheus metrics naming convention #5601

Bug: facing challenges with recently changed prometheus metrics naming convention #5601

Comments

tayalrishabh96 commented Jul 31, 2024

📜 Description

Affected areas

Additional affected areas

Prod/Non-prod environments?

Is User unblocked?

How was the user un-blocked?

Impact on Enterprise

👟 Steps to replicate the Issue

👍 Expected behavior

👎 Actual Behavior

☸ Kubernetes version

Cloud provider

🌍 Browser

✅ Proposed Solution

👀 Have you spent some time to check if this issue has been raised before?

🏢 Have you read the Code of Conduct?

github-actions bot commented Jul 31, 2024