Skip to content

Conversation

@QuentinBisson
Copy link
Contributor

@QuentinBisson QuentinBisson commented Nov 27, 2025

Before adding a new alerting rule into this repository you should consider creating an SLO rules instead.
SLO helps you both increase the quality of your monitoring and reduce the alert noise.


This PR updates and fixes the readme a bit + adds the new ticket alerts information.
It also cleans up some minor things

Checklist

@QuentinBisson QuentinBisson self-assigned this Nov 27, 2025
@QuentinBisson QuentinBisson requested a review from a team as a code owner November 27, 2025 13:24
@QuentinBisson QuentinBisson requested a review from a team as a code owner November 27, 2025 13:39
@github-project-automation github-project-automation bot moved this to Inbox 📥 in Roadmap Nov 27, 2025
@QuentinBisson QuentinBisson moved this from Inbox 📥 to Blocked / Waiting ⛔️ in Roadmap Nov 27, 2025
for: 15m
labels:
severity: none
severity: notify
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not sure it's related to the "update readme" PR title 😅
More seriously, the goal of an alert with severity none is that we can spot easily that something is fishy when looking at alertmanager or alerts timeline dashboard, without having it unnecessarily send a page or a Slack notif.

Is there a specific need for this change?
If yes, please don't do it hidden in an unrelated PR.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Well, no, I got a bit too eager with my change. We use this severity: none but it was not defined in the readme nor in pint.
Then I figured it would be best that only inhibitions and heartbeats have the severity none as having it set to notify for this alert would not really change anything

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It would make it send alerts to Slack, and make the alerts channels unreadable (yes, some of us read them 😁 )

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sure, but then, what do we do with that? Should we make it a ticket alert?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sure, but then, what do we do with that? Should we make it a ticket alert?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This specific alert?
Maybe just discard it.

tempo/update.sh Outdated
Comment on lines 64 to 67
# This one fires for 2 days when a new cluster is created,
# so we don't want it to page until we run tempo all around.
rule="$(echo "$rule" \
| sed 's/"severity": "page"/"severity": "none"/' \
| sed 's/"severity": "page"/"severity": "notify"/' \
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

When we say we don't want it to page, I guess we don't want it to spam Slack either.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

True, please check my other comment for the reasonning :)

@hervenicol
Copy link
Contributor

Looks like there's 2 PRs in one:

  • one that updates the readme (fits with the PR title)
  • one that changes the CI to add extra checks / rework pint config

@QuentinBisson
Copy link
Contributor Author

I think it's actually 3, with the tempo alert change... I'll split this in 3

@QuentinBisson
Copy link
Contributor Author

@hervenicol it should be better now

@QuentinBisson QuentinBisson enabled auto-merge (squash) December 1, 2025 13:20
@QuentinBisson QuentinBisson merged commit cf689f5 into main Dec 1, 2025
6 checks passed
@QuentinBisson QuentinBisson deleted the update-readme branch December 1, 2025 13:41
@github-project-automation github-project-automation bot moved this from Blocked / Waiting ⛔️ to Done ✅ in Roadmap Dec 1, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Status: Done ✅

Development

Successfully merging this pull request may close these issues.

3 participants