chore(eks): add support for TRN1, TRN1N, TRN2, P5, P5E, P5EN instance types #36032
+15
−2
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Issue # (if applicable)
Closes #35914.
Reason for this change
Users cannot create EKS managed node groups with newer GPU and accelerator instance types (Trainium TRN1/TRN1N/TRN2 and P5/P5E/P5EN series). The
isGpuInstanceType()validation function does not recognize these instance types, causing validation errors that block deployment.Description of changes
Updated the
isGpuInstanceType()function inpackages/aws-cdk-lib/aws-eks/lib/private/nodegroup.tsto recognize 6 additional GPU and accelerator instance types:Trainium Instances (AWS Neuron accelerators):
P5 Series GPUs (NVIDIA H100):
The implementation adds these instance classes to the existing
knownGpuInstanceTypesarray, maintaining logical grouping (P-series together, TRN-series as new group). This enables proper AMI type selection (AL2023_X86_64_NEURON for Trainium, AL2023_X86_64_NVIDIA for P5).Before:
After:
Describe any new or updated permissions being added
N/A - This change only updates client-side validation logic. No IAM permissions or resource access changes.
Description of how you validated changes
nodegroup.test.ts. All 365 unit tests pass with zero regressions.integ.eks-inference-nodegroup.ts) demonstrate the correct usage for accelerator instances.Checklist
By submitting this pull request, I confirm that my contribution is made under the terms of the Apache-2.0 license