Skip to content

Conversation

@TimDiekmann
Copy link
Member

🌟 What is the purpose of this PR?

Add a CloudWatch alarm for high CPU utilization on RDS PostgreSQL instances to improve monitoring and alerting capabilities.

🔍 What does this change?

  • Adds a new CloudWatch metric alarm that triggers when RDS CPU utilization exceeds 80% for 10 minutes
  • Configures the alarm to send notifications to the existing database alerts SNS topic
  • Sets appropriate tags for severity (WARNING) and purpose documentation

🛡 What tests cover this?

  • Terraform plan validation will verify the resource configuration

❓ How to test this?

  1. Apply the Terraform changes
  2. Verify the new alarm appears in CloudWatch
  3. Optionally, simulate high CPU load to confirm the alert triggers correctly

Pre-Merge Checklist 🚀

🚢 Has this modified a publishable library?

This PR:

  • does not modify any publishable blocks or libraries, or modifications do not need publishing

📜 Does this require a change to the docs?

The changes in this PR:

  • are internal and do not require a docs change

🕸️ Does this require a change to the Turbo Graph?

The changes in this PR:

  • do not affect the execution graph

@github-actions github-actions bot added area/infra Relates to version control, CI, CD or IaC (area) area/infra > terraform labels Nov 3, 2025
@graphite-app
Copy link
Contributor

graphite-app bot commented Nov 3, 2025

Graphite Automations

"Request DevOps reviewers once CI passes" took an action on this PR • (11/03/25)

1 reviewer was added to this PR based on Tim Diekmann's automation.

@TimDiekmann TimDiekmann force-pushed the t/sre-80-vanta-remediate-sql-database-cpu-monitored branch from 90c4cee to cfb61a8 Compare November 3, 2025 19:03
@TimDiekmann TimDiekmann force-pushed the t/sre-79-vanta-remediate-sql-database-free-storage-space-monitored branch from 67decc1 to 91e3691 Compare November 3, 2025 19:03
@vilkinsons vilkinsons changed the title SRE-80: Add CPUUtilization Cloudwatch alert for RDS SRE-80: Add CPUUtilization CloudWatch alert for RDS Nov 3, 2025
Base automatically changed from t/sre-79-vanta-remediate-sql-database-free-storage-space-monitored to main November 4, 2025 10:42
@TimDiekmann TimDiekmann force-pushed the t/sre-80-vanta-remediate-sql-database-cpu-monitored branch from cfb61a8 to e4a9a36 Compare November 4, 2025 10:44
@graphite-app
Copy link
Contributor

graphite-app bot commented Nov 4, 2025

Merge activity

  • Nov 4, 10:45 AM UTC: Graphite rebased this pull request, because this pull request is set to merge when ready.

@TimDiekmann TimDiekmann added this pull request to the merge queue Nov 4, 2025
Merged via the queue into main with commit f5288c1 Nov 4, 2025
34 checks passed
@TimDiekmann TimDiekmann deleted the t/sre-80-vanta-remediate-sql-database-cpu-monitored branch November 4, 2025 11:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

area/infra > terraform area/infra Relates to version control, CI, CD or IaC (area)

Development

Successfully merging this pull request may close these issues.

3 participants