KEP-5278: update KEP for NominatedNodeName, narrowing down the scope of the feature and moving it to beta #5618

ania-borowiec · 2025-10-06T10:09:37Z

One-line PR description: Narrow down the scope of the feature, allowing it to move to beta

Issue link: Use NominatedNodeName to express the expected pod placement #5278

Other comments:

ania-borowiec · 2025-10-06T10:10:04Z

/cc @dom4ha @sanposhiho @macsko

macsko · 2025-10-06T10:12:41Z

/hold
Waiting for the rest of updates for v1.35

keps/sig-scheduling/5278-nominated-node-name-for-expectation/kep.yaml

stlaz

PRR shadow review

stlaz · 2025-10-10T12:51:29Z

keps/sig-scheduling/5278-nominated-node-name-for-expectation/README.md

+simply be rejected by the scheduler (and the `NominatedNodeName` will be cleared before
+moving the rejected pod to unschedulable).

 #### Increasing the load to kube-apiserver


I don't understand scheduling all that well so correct me if my assumptions are incorrect.

Is the assumption here that if PreBind() plugins skip, the binding operation will never take too much time and so we don't need to expose NominatedNodeName? This is important for the KEP to not be in conflict with "User story 1".

It might be worth noting that updating NominatedNodeName for every pod would only double the API requests per pod in the happy path. If I understand the docs at https://kubernetes.io/docs/concepts/scheduling-eviction/scheduling-framework/#pre-bind correctly, if there were some good-looking (from scheduling perspectives) nodes that would however cause the prebind plugins to fail often, that might increase the API requests N-times, where N>=2.

Correct, the assumption is that all tasks related to binding that may take long to complete (e.g. creating volumes, attaching DRA devices) are executed in PreBind(), and Bind() should not take too much time.

As far as I know increasing the number of API requests by 2x is not acceptable, as this would happen for every pod being scheduled (or re-scheduled), so it might add up to a huge number.
Also adding an extra API call before binding makes the entire procedure a bit longer, and the scheduling throughput a bit lower - so if we assume that Bind() will be quick, we should avoid that extra cost.

stlaz · 2025-10-10T12:57:15Z

keps/sig-scheduling/5278-nominated-node-name-for-expectation/README.md

+The feature can be disabled in Beta version by restarting the kube-scheduler and kube-apiserver with the feature-gate off.

 ###### What happens if we reenable the feature if it was previously rolled back?



In ###### Are there any tests for feature enablement/disablement?

This feature is only changing when a NominatedNodeName field will be set - it doesn't introduce a new API.

Is that correct? NominatedNodeName sounds like a new field in the Pod API.

In ###### Were upgrade and rollback tested? Was the upgrade->downgrade->upgrade path tested?

We will do the following manual test after implementing the feature:

What was the result of the test?

NominatedNodeName field wasn't added by this KEP (it was added long time ago). KEP's purpose is to extend usage of this field.

NominatedNodeName field wasn't added by this KEP (it was added long time ago). KEP's purpose is to extend usage of this field.

The idea behind enablement/disablement tests is that depending on the FG the functionality is or is not working. So the question is more around ensuring that when you turn off the appropriate FG the functionality doesn't set NNN, or in case of kube-apiserver doesn't clear it, and vice versa when it's on. Especially at beta stage, where we need to ensure that users can safely turn off this on-by-default (beta) functionality.

In ###### Were upgrade and rollback tested? Was the upgrade->downgrade->upgrade path tested?

We will do the following manual test after implementing the feature:

As @stlaz pointed out, this description is required for beta promotion.

Below in ##### What are the reasonable SLOs (Service Level Objectives) for the enhancement? I assume the functionality has been already tested, can you update that section with findings for the throughput?

@soltysh
Addressing only the comments about upgrade->downgrade->upgrade testing (I will address the rest later today):

The idea behind enablement/disablement tests is that depending on the FG the functionality is or is not working. So the question is more around ensuring that when you turn off the appropriate FG the functionality doesn't set NNN, or in case of kube-apiserver doesn't clear it, and vice versa when it's on. Especially at beta stage, where we need to ensure that users can safely turn off this on-by-default (beta) functionality.

Thank you for bringing my attention to this point, I missed this earlier when editing the KEP.
And now I'm not really sure what this test should look like.

So the trouble is, this is not a typical promotion from alpha to beta.

The scope of this KEP in alpha allowed NNN to be set by components other than kube-scheduler and established the semantics for this field. The designed behavior was that the NNN field would be cleared in some situations, but not after a failed scheduling attempt.

But after the v1.34 release there was a change of plans in sig-scheduling, and with the idea of Gang Scheduling coming up (and with that - ideas for new approaches to resource reservation), it seems that NNN might not be the mechanism we want to invest in right now, as a means for other components to suggest pod placement to kube-scheduler.

At the same time using NNN as "set in kube-scheduler, read-only in CA" seems like a good and worthwhile approach to solve the buggy scenario "If pod P is scheduled to bind on node N, but binding P takes a long time, and N is otherwise empty, CA might turn down N before P gets bound".
So the decision was to narrow down the scope of this KEP significantly, and get it to beta.

Please note that before the alpha KEP the scheduler's code would clear NNN after a failed scheduling attempt. So what this hoping-to-be-beta KEP does vs pre-alpha is:

introduces setting NNN in PreBinding phase, i.e. when scheduler expects that the entire prebinding + binding process may take a significant amount of time (gated by nominatedNodeNameForExpectationEnabled)

makes kube-apiserver clear NNN when the pod gets bound (gated by ClearingNominatedNodeNameAfterBinding)

And what this beta-KEP does vs alpha-KEP is:

reverts the logic that does not clear NNN upon failed scheduling attempt (the logic was gated by nominatedNodeNameForExpectationEnabled)

With all that, in beta-KEP the NNN should be set when a pod is either waiting for preemption to complete (which had been the case before alpha-KEP), or during prebinding/binding phases. And it should be cleared after binding in api-server.

Can you please help me with the following questions?

Since the implementation has yet to change, is it ok if I run the test after implementing the new version?

IIUC the upgrade->downgrade->upgrade test scenario should be as follows, can you verify that?

upgrade

request scheduling of a pod that will need a long preBinding phase

check that NNN gets set for that pod

before binding completes, restart the scheduler with nominatedNodeNameForExpectationEnabled = false

check that the pod gets scheduled and bound successfully to the same node

and upgrade again?
Or:
6a. request scheduling another pod with expected long preBind
7a. check that NNN does not get set in PreBind
8a. restart the scheduler with nominatedNodeNameForExpectationEnabled = true
9a. check that the pod gets scheduled and bound somewhere

Thank you!

@soltysh

Below in ##### What are the reasonable SLOs (Service Level Objectives) for the enhancement? I assume the functionality has been already tested, can you update that section with findings for the throughput?

As it turns out, there are no tests running with this feature enabled (probably because the original plan was to launch in beta in 1.34, and the FG would be on by default, and all tests would run it).
I can run perf tests now and update the numbers here, but since the implementation is going to change after KEP update, perhaps I can run the test on the actual implementation, when it's ready? WTYD?

1. Since the implementation has yet to change, is it ok if I run the test after implementing the new version?

Yes, you'll want to update the doc afterwards.

2. IIUC the upgrade->downgrade->upgrade test scenario should be as follows, can you verify that? 3. upgrade 4. request scheduling of a pod that will need a long preBinding phase 5. check that NNN gets set for that pod 6. before binding completes, restart the scheduler with nominatedNodeNameForExpectationEnabled = false 7. check that the pod gets scheduled and bound successfully to the same node 8. and upgrade again? Or: 6a. request scheduling another pod with expected long preBind 7a. check that NNN does not get set in PreBind 8a. restart the scheduler with nominatedNodeNameForExpectationEnabled = true 9a. check that the pod gets scheduled and bound somewhere

Both options are fine by me. But it seems the versions with (a) will be easier to perform.

As it turns out, there are no tests running with this feature enabled (probably because the original plan was to launch in beta in 1.34, and the FG would be on by default, and all tests would run it).
I can run perf tests now and update the numbers here, but since the implementation is going to change after KEP update, perhaps I can run the test on the actual implementation, when it's ready? WTYD?

Yes, it can be updated in followup.

Thank you! I will make sure to update the doc with all the results

stlaz · 2025-10-10T13:06:12Z

keps/sig-scheduling/5278-nominated-node-name-for-expectation/README.md

+During the beta period, the feature gates `NominatedNodeNameForExpectation` and `ClearingNominatedNodeNameAfterBinding` are enabled by default, no action is needed.

 **Downgrade**



In ### Versions Skew Strategy:

What happens to the pods that already have the NominatedNodeName set in a cluster with kube-apiserver that does not understand that field?
What happens if a scheduler tries to set NominatedNodeName on all of its scheduled pods while contacting an older kube-apiserver that does not know the field?

These questions are related to rollout/rollback section of the PRR questionnaire.

kube-apiserver has known this field for a long time, but does not interpret it - setting / using NominatedNodeName field in components other that kube-scheduler is out of scope of this KEP.

This field was introduced in 2018 (kubernetes/kubernetes@384a86c) - I assume that if we try using kube-apiserver from pre-2018 with kube-scheduler v1.35, this would cause way bigger problems than just trouble with handling NominatedNodeName.

I didn't know we had the fields for such a long time, we don't need to worry about it not being present, then 👍

keps/sig-scheduling/5278-nominated-node-name-for-expectation/README.md

macsko · 2025-10-10T14:23:54Z

keps/sig-scheduling/5278-nominated-node-name-for-expectation/README.md

+The feature can be disabled in Beta version by restarting the kube-scheduler and kube-apiserver with the feature-gate off.

 ###### What happens if we reenable the feature if it was previously rolled back?



NominatedNodeName field wasn't added by this KEP (it was added long time ago). KEP's purpose is to extend usage of this field.

keps/sig-scheduling/5278-nominated-node-name-for-expectation/kep.yaml

keps/sig-scheduling/5278-nominated-node-name-for-expectation/README.md

wojtek-t · 2025-10-13T14:22:11Z

/lgtm

dom4ha

I believe it's important to cover DRA resources accounting as well to completely address scheduler->CA resource accounting problem for pods in delayed binding cases.

@ania-borowiec Sorry that I suggested wording of entire sections, but explaining what I suggest to add would be almost identical to what I wrote anyway. Feel free to reword and rearrange it.

@wojtek-t @sanposhiho @macsko

dom4ha · 2025-10-13T14:53:58Z

keps/sig-scheduling/5278-nominated-node-name-for-expectation/README.md

 the cluster autoscaler cannot understand the pod is going to be bound there,
 misunderstands the node is low-utilized (because the scheduler keeps the place of the pod), and deletes the node. 

 We can expose those internal reservations with `NominatedNodeName` so that external components can take a more appropriate action


You can add another paragraph how DRA interacts with NNN, as it's important in scheduler->CA resource accounting:

Please note that the NominatedNodeName can express reservation of node resources only, but some resources can be managed by DRA plugin and expressed in a form of ResourceClaim allocation. To correctly account all the resources that a pod needs, both the nomination and the ResourceClaim status update needs to be reflected in the api-server.

@x13n Can you confirm my understanding for the Cluster Autoscaler part? Would the DRA resources be correctly accounted as in-use as soon as the the ResourceClaim allocation is reflected in the api-server?

IIUC the accounting of ResourceClaim allocation does not depend on having a pod that is using it to be neither bound NN nor having NNN nomination.

Yes, that's my understanding, though @towca please keep me honest here.

@johnbelamaric FYI

I'm not sure I get the question, but I would love for this KEP to clarify how Cluster Autoscaler should interact with Pods that have nominatedNodeName set.

The current behavior is that if CA sees nominatedNodeName on a Pod, it adds the Pod onto the nominated Node in its simulation, without checking kube-scheduler Filters. In particular, if the preempted Pod(s) are still on the Node, the Node is effectively "overscheduled" in CA simulations. This ensures that CA doesn't trigger an unnecessary scale-up for the preemptor Pod.

The above logic worked well enough before DRA, but it's not correct for Pods that reference ResourceClaims. nominatedNodeName can be set before the ResourceClaims are allocated, and CA won't allocate the claims in its simulation because it's not running scheduler Filters before adding the Pod to the Node. So when that happens, the DRA Devices needed by the claims are effectively free to be taken by other Pods in CA simulations. If there are other pending Pods referencing claims that can be satisfied by these Devices, CA will not scale-up for them until the preemption is completed. I described an example problematic scenario in detail here: kubernetes/autoscaler#7683 (comment).

CA could start running the Filters for Pods with nominatedNodeName set, but then if the preemption isn't completed yet they won't pass and the Pod won't fit - which will trigger an unnecessary scale-up. So this also doesn't sound like a good option.

This wouldn't be an issue if kube-scheduler persisted the claim allocations in the API before setting nominatedNodeName. But if some claims need to be deallocated as part of the preemption, that doesn't seem possible.

WDYT we should do here? It doesn't have to be a part of this KEP of course, but we'll need to figure it out at some point (kubernetes/autoscaler#7683).

This wouldn't be an issue if kube-scheduler persisted the claim allocations in the API before setting nominatedNodeName.

Currently the order is different, but in a follow up we can change it, as we need to export ResourceClaim allocation before WaitOnPermit anyway. That would require introducing a new phase in scheduler "Reserve in api-server", so currently plugins cannot do that earlier than in the Pre-Bind.

Is the order that important though? CA would get both updates, so eventually its state should be consistent.

keps/sig-scheduling/5278-nominated-node-name-for-expectation/README.md

dom4ha · 2025-10-13T16:54:52Z

keps/sig-scheduling/5278-nominated-node-name-for-expectation/README.md

-As discussed at [Confusion if `NominatedNodeName` is different from `NodeName` after all](#confusion-if-nominatednodename-is-different-from-nodename-after-all),
-we update kube-apiserver so that it clears `NominatedNodeName` when receiving binding requests.
+We update kube-apiserver so that it clears `NominatedNodeName` when receiving binding requests.



Handling ResourceClaim status updates

Since ResourceClaim status updates is complementary to resource nomination (reserves resources in a similar way), it's desired that they will be set at the beginning of the PreBinding phase (before it waits). The order of actions in the devicemanagement plugin is correct, however scheduler performs the binding actions of different plugins sequentially, therefore for instance it may happen that long lasting PVC provisioning may delay exporting ResourceClaim allocation status. It is not desired as it leaves gap of not-reserved DRA resources causing similar problems to the ones originally fixed by this KEP - kubernetes/kubernetes#125491

Would executing these binding plugins in parallel (not necessarily here, I'm thinking a separate effort) be a feasible improvement? Are there going to be independent long-running allocations? Or is it just slow PVC provisioning? In the latter case, would it suffice to ensure PVC plugin is executed last?

I don't see a reason why PVC provisioning should delay DRA provisioning and vice-versa, so I'd say it should be pretty straightforward (at least theoretically) change. Yes, assuring the right order is the minimum.

Regarding exporting the ResourceClaim allocation, we most likely need to move it into a new phase anyway, since pre-binding currently happens after WaitOnPermit, but the node nomination is exported before it. We most likely would like to make those two mechanisms consistent and more tightly coupled, but that requires a bit wider agreement and common understanding among different components, so not only CA but also kubelets.

soltysh

The main bits missing for PRR are tests:

units - minor updates
integration - missing links
enable/disable feature gate
upgrade->downgrade tests

Additionally, updated SLO (based on tests in alpha) would be nice, but most importantly the read-only nature of NNN field and its impact.

keps/sig-scheduling/5278-nominated-node-name-for-expectation/README.md

soltysh · 2025-10-14T08:46:28Z

keps/sig-scheduling/5278-nominated-node-name-for-expectation/README.md

 Also, with [scheduler-perf](https://github.com/kubernetes/kubernetes/tree/master/test/integration/scheduler_perf), we'll make sure the scheduling throughputs for pods that go through Permit or PreBind don't get regress too much.
 We need to accept a small regression to some extent since there'll be a new API call to set NominatedNodeName. 
-But, as discussed, assuming PreBind already makes some API calls for the pods, the regrassion there should be small.
+But, as discussed, assuming PreBind already makes some API calls for the pods, the regression there should be small.


I assume the intention to add these tests has materialized in alpha, ideally all but hopefully most 😅 Can you please link per the template?

Especially, as mentioned below, all functionality will be covered with integration tests.

keps/sig-scheduling/5278-nominated-node-name-for-expectation/README.md

soltysh · 2025-10-14T08:51:59Z

keps/sig-scheduling/5278-nominated-node-name-for-expectation/README.md

+The feature can be disabled in Beta version by restarting the kube-scheduler and kube-apiserver with the feature-gate off.

 ###### What happens if we reenable the feature if it was previously rolled back?



NominatedNodeName field wasn't added by this KEP (it was added long time ago). KEP's purpose is to extend usage of this field.

The idea behind enablement/disablement tests is that depending on the FG the functionality is or is not working. So the question is more around ensuring that when you turn off the appropriate FG the functionality doesn't set NNN, or in case of kube-apiserver doesn't clear it, and vice versa when it's on. Especially at beta stage, where we need to ensure that users can safely turn off this on-by-default (beta) functionality.

soltysh · 2025-10-14T08:54:37Z

keps/sig-scheduling/5278-nominated-node-name-for-expectation/README.md

+The feature can be disabled in Beta version by restarting the kube-scheduler and kube-apiserver with the feature-gate off.

 ###### What happens if we reenable the feature if it was previously rolled back?



In ###### Were upgrade and rollback tested? Was the upgrade->downgrade->upgrade path tested?

We will do the following manual test after implementing the feature:

As @stlaz pointed out, this description is required for beta promotion.

soltysh · 2025-10-14T08:56:16Z

keps/sig-scheduling/5278-nominated-node-name-for-expectation/README.md

+The feature can be disabled in Beta version by restarting the kube-scheduler and kube-apiserver with the feature-gate off.

 ###### What happens if we reenable the feature if it was previously rolled back?



Below in ##### What are the reasonable SLOs (Service Level Objectives) for the enhancement? I assume the functionality has been already tested, can you update that section with findings for the throughput?

keps/sig-scheduling/5278-nominated-node-name-for-expectation/README.md

soltysh · 2025-10-14T08:59:58Z

keps/sig-scheduling/5278-nominated-node-name-for-expectation/README.md

+#### Allow NominatedNodeName to be set by other components
+
+In v1.35 this feature is being narrowed down to one-way communication: only kube-scheduler is allowed to set `NominatedNodeName`,
+while for other components this field should be read-only.


Given this is one of the main goals for 1.35, I haven't seen anywhere in the document description what happens if another actor sets this field? Alternatively, how can you ensure this field is not set by external actors inside kube-apiserver?

Sure, I added a section about this in Risks and mitigations

soltysh

So with the narrowed scope there are several things we need to agree on:

Some tests (FG on/off, upgrade->downgrade and performance) can wait until after implementation.
My question regarding [other actors acting on NNN and how this will be handled in updated version] is a must. Furthermore, I'd like to see a description how rollback to previous versions (pre 1.35 with new, narrow functionality) will impact the system. I'm especially interested in identifying and eliminating/warning cluster admins from achieving erroneous situation coming from in-progress upgrade/downgrade where one component runs older (more feature-rich solution), whereas other runs new version (with narrowed functionality). I don't think such a risk exists, but maybe I'm missing something?

soltysh · 2025-10-15T19:29:14Z

keps/sig-scheduling/5278-nominated-node-name-for-expectation/README.md

+The feature can be disabled in Beta version by restarting the kube-scheduler and kube-apiserver with the feature-gate off.

 ###### What happens if we reenable the feature if it was previously rolled back?



1. Since the implementation has yet to change, is it ok if I run the test after implementing the new version?

Yes, you'll want to update the doc afterwards.

2. IIUC the upgrade->downgrade->upgrade test scenario should be as follows, can you verify that? 3. upgrade 4. request scheduling of a pod that will need a long preBinding phase 5. check that NNN gets set for that pod 6. before binding completes, restart the scheduler with nominatedNodeNameForExpectationEnabled = false 7. check that the pod gets scheduled and bound successfully to the same node 8. and upgrade again? Or: 6a. request scheduling another pod with expected long preBind 7a. check that NNN does not get set in PreBind 8a. restart the scheduler with nominatedNodeNameForExpectationEnabled = true 9a. check that the pod gets scheduled and bound somewhere

Both options are fine by me. But it seems the versions with (a) will be easier to perform.

As it turns out, there are no tests running with this feature enabled (probably because the original plan was to launch in beta in 1.34, and the FG would be on by default, and all tests would run it).
I can run perf tests now and update the numbers here, but since the implementation is going to change after KEP update, perhaps I can run the test on the actual implementation, when it's ready? WTYD?

Yes, it can be updated in followup.

…component sets NNN

soltysh

/approve
the PRR section

dom4ha · 2025-10-16T12:53:43Z

Great! Thanks Ania, it should mitigate the important race between scheduler and CA. We soon should have a proposals how to expand this feature further.

/lgtm
/approve

macsko · 2025-10-16T13:07:08Z

/approve

/unhold

k8s-ci-robot · 2025-10-16T13:07:18Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: ania-borowiec, dom4ha, macsko, soltysh

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~keps/prod-readiness/OWNERS~~ [soltysh]
~~keps/sig-scheduling/OWNERS~~ [dom4ha,macsko]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

prr questionnaire updated

c0bce5c

k8s-ci-robot added cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. kind/kep Categorizes KEP tracking issues and PRs modifying the KEP directory labels Oct 6, 2025

k8s-ci-robot requested review from dom4ha and macsko October 6, 2025 10:09

k8s-ci-robot added the sig/scheduling Categorizes an issue or PR as relevant to SIG Scheduling. label Oct 6, 2025

github-project-automation bot added this to SIG Scheduling Oct 6, 2025

k8s-ci-robot added the size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. label Oct 6, 2025

k8s-ci-robot requested a review from sanposhiho October 6, 2025 10:10

k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Oct 6, 2025

soltysh reviewed Oct 7, 2025

View reviewed changes

keps/sig-scheduling/5278-nominated-node-name-for-expectation/kep.yaml Show resolved Hide resolved

prr approver updated

e0345cd

k8s-ci-robot added size/S Denotes a PR that changes 10-29 lines, ignoring generated files. and removed size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. labels Oct 7, 2025

jmickey mentioned this pull request Oct 7, 2025

Use NominatedNodeName to express the expected pod placement #5278

Open

6 tasks

ania-borowiec added 2 commits October 8, 2025 11:03

update stage in kep.yaml

177e105

update kep to narrow down the scope

7d504eb

k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/S Denotes a PR that changes 10-29 lines, ignoring generated files. labels Oct 9, 2025

ania-borowiec changed the title ~~KEP-5278: update PRR review questionnaire for move to beta~~ KEP-5278: update KEP for NominatedNodeName, narrowing down the scope of the feature and moving it to beta Oct 10, 2025

stlaz reviewed Oct 10, 2025

View reviewed changes

macsko reviewed Oct 10, 2025

View reviewed changes

sanposhiho reviewed Oct 11, 2025

View reviewed changes

wojtek-t self-assigned this Oct 13, 2025

wojtek-t reviewed Oct 13, 2025

View reviewed changes

review comments applied

e2533f7

ania-borowiec force-pushed the nnn_update branch from 2c281b6 to e2533f7 Compare October 13, 2025 11:59

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Oct 13, 2025

dom4ha reviewed Oct 13, 2025

View reviewed changes

soltysh reviewed Oct 14, 2025

View reviewed changes

k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Oct 15, 2025

apply dom4ha's comments

bd7922d

ania-borowiec force-pushed the nnn_update branch from 72bef55 to bd7922d Compare October 15, 2025 10:06

first portion of comments by soltysh@ applied

65a5008

helayoty moved this to In Progress in SIG Scheduling Oct 15, 2025

towca mentioned this pull request Oct 15, 2025

CA DRA: support priority-based preempting pods using DRA kubernetes/autoscaler#7683

Open

soltysh reviewed Oct 15, 2025

View reviewed changes

k8s-ci-robot added size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. and removed size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Oct 16, 2025

updated test scenarios and links. Described what happens if external …

64d8419

…component sets NNN

ania-borowiec force-pushed the nnn_update branch from 78faaba to 64d8419 Compare October 16, 2025 09:48

soltysh approved these changes Oct 16, 2025

View reviewed changes

k8s-ci-robot assigned dom4ha Oct 16, 2025

k8s-ci-robot added lgtm "Looks good to me", indicates that a PR is ready to be merged. approved Indicates a PR has been approved by an approver from all required OWNERS files. labels Oct 16, 2025

k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Oct 16, 2025

k8s-ci-robot merged commit 062c236 into kubernetes:master Oct 16, 2025
4 checks passed

k8s-ci-robot added this to the v1.35 milestone Oct 16, 2025

github-project-automation bot moved this from In Progress to Done in SIG Scheduling Oct 16, 2025

		The feature can be disabled in Beta version by restarting the kube-scheduler and kube-apiserver with the feature-gate off.

		###### What happens if we reenable the feature if it was previously rolled back?

		During the beta period, the feature gates `NominatedNodeNameForExpectation` and `ClearingNominatedNodeNameAfterBinding` are enabled by default, no action is needed.

		Downgrade

KEP-5278: update KEP for NominatedNodeName, narrowing down the scope of the feature and moving it to beta #5618

KEP-5278: update KEP for NominatedNodeName, narrowing down the scope of the feature and moving it to beta #5618

Conversation

ania-borowiec commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ania-borowiec commented Oct 6, 2025

Uh oh!

macsko commented Oct 6, 2025

Uh oh!

Uh oh!

stlaz left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ania-borowiec Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wojtek-t commented Oct 13, 2025

Uh oh!

dom4ha left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

dom4ha Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Handling ResourceClaim status updates

ania-borowiec commented Oct 6, 2025 •

edited

Loading

ania-borowiec Oct 13, 2025 •

edited

Loading

dom4ha Oct 13, 2025 •

edited

Loading

soltysh left a comment •

edited

Loading