feat(control-plane): [issue 4746] pool sufficiency metrics #4747

mabbott-aurorasolar · 2025-09-04T14:46:09Z

See #4746 for full context.

In short, this PR will add metrics (optionally) to record whether or not a given runner pool was sufficient to handle a job when it arrives. This will allow users (specifically, my company) to better understand whether we're right-sizing our pools or not.

Copilot

Pull Request Overview

This PR adds optional pool sufficiency metrics to track whether runner pools are adequately sized to handle incoming jobs. The feature allows users to monitor if their pool configurations are right-sized by recording metrics when jobs arrive.

Adds a new enable_pool_sufficiency configuration option across all relevant modules
Implements metric collection logic in the scale-up function to track pool adequacy
Includes comprehensive tests to verify the metric behavior under different scenarios

Reviewed Changes

Copilot reviewed 10 out of 10 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
variables.tf	Adds enable_pool_sufficiency option to root module metrics configuration
modules/runners/variables.tf	Adds enable_pool_sufficiency option to runners module metrics configuration
modules/runners/scale-up.tf	Passes pool sufficiency metric environment variable to scale-up Lambda
modules/runners/job-retry/variables.tf	Adds enable_pool_sufficiency option to job-retry module configuration
modules/runners/job-retry/main.tf	Passes pool sufficiency metric environment variable to job-retry Lambda
modules/multi-runner/variables.tf	Adds enable_pool_sufficiency option to multi-runner module metrics configuration
lambdas/functions/control-plane/src/scale-runners/scale-up.ts	Implements pool sufficiency metric collection logic
lambdas/functions/control-plane/src/scale-runners/scale-up.test.ts	Adds comprehensive tests for pool sufficiency metric functionality
examples/multi-runner/main.tf	Documents the new enable_pool_sufficiency option in example
examples/default/main.tf	Documents the new enable_pool_sufficiency option in example

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

Copilot · 2025-09-04T18:43:32Z

lambdas/functions/control-plane/src/scale-runners/scale-up.ts

+function createPoolSufficiencyMetric(environment: string, payload: ActionRequestMessage, wasSufficient: boolean) {
+  if (yn(process.env.ENABLE_METRIC_POOL_SUFFICIENCY, { default: false })) {
+    const metric = createSingleMetric('SufficientPoolHosts', MetricUnit.Count, wasSufficient ? 1.0 : 0.0, {
+      Environment: environment,
+    });
+    metric.addMetadata('Environment', environment);
+    metric.addMetadata('RepositoryName', payload.repositoryName);
+    metric.addMetadata('RepositoryOwner', payload.repositoryOwner);
+  }
+}


The Environment dimension is added twice - once in the metric creation and again as metadata. The dimension in the metric creation (line 487) should be sufficient for grouping metrics, making the duplicate metadata on line 489 redundant.

I think this is not correct review comment. First time environment is added as dimension. addMetadata is only added in the log not as dimension, correct?

lambdas/functions/control-plane/src/scale-runners/scale-up.test.ts

npalm · 2025-09-04T18:44:59Z

@mabbott-aurorasolar thx for our PR. I have the next day not much time to dig in. Are you considering the PR ready? If not you can mark it as draft. I see no documentation, at least this part of the docs needs to be adjusted: https://github-aws-runners.github.io/terraform-aws-github-runner/configuration/#multiple-runner-module-in-your-aws-account

mabbott-aurorasolar · 2025-09-04T20:26:25Z

@npalm great callout on the docs. I'll update those as well, and follow-up on copilot's suggestions. I honestly threw this together in an hour this morning and wasn't sure what the usual process on this repo is, but yeah I do think this is mostly feature complete.

npalm · 2025-09-23T20:17:29Z

Sorry for keep you wainting. Will do my best to put the PR on the list for the next week.

npalm · 2025-10-03T15:09:27Z

@mabbott-aurorasolar PR looks good in general, not tested yet. Did you had time to have a look at the docs?

feat(control-plane): [issue 4746] pool sufficiency metrics

eaaf4f5

mabbott-aurorasolar requested review from a team as code owners September 4, 2025 14:46

npalm requested a review from Copilot September 4, 2025 18:42

Copilot AI reviewed Sep 4, 2025

View reviewed changes

Merge branch 'main' into feat/issue-4746-sufficiency-metrics

2a5ce78

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(control-plane): [issue 4746] pool sufficiency metrics #4747

feat(control-plane): [issue 4746] pool sufficiency metrics #4747

Uh oh!

mabbott-aurorasolar commented Sep 4, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Sep 4, 2025

Uh oh!

npalm Oct 3, 2025

Uh oh!

Uh oh!

npalm commented Sep 4, 2025

Uh oh!

mabbott-aurorasolar commented Sep 4, 2025

Uh oh!

npalm commented Sep 23, 2025

Uh oh!

npalm commented Oct 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat(control-plane): [issue 4746] pool sufficiency metrics #4747

Are you sure you want to change the base?

feat(control-plane): [issue 4746] pool sufficiency metrics #4747

Uh oh!

Conversation

mabbott-aurorasolar commented Sep 4, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Sep 4, 2025

Choose a reason for hiding this comment

Uh oh!

npalm Oct 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

npalm commented Sep 4, 2025

Uh oh!

mabbott-aurorasolar commented Sep 4, 2025

Uh oh!

npalm commented Sep 23, 2025

Uh oh!

npalm commented Oct 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants