[KEP-5710]: Workload-aware preemption KEP #5711

wojtek-t · 2025-11-28T14:26:13Z

One-line PR description: First draft of Workload-aware preemption KEP
Issue link: Workload-aware preemption #5710

k8s-ci-robot · 2025-11-28T14:26:23Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: wojtek-t
Once this PR has been reviewed and has the lgtm label, please assign sanposhiho for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

keps/sig-scheduling/OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

wojtek-t · 2025-11-28T14:26:33Z

@dom4ha @sanposhiho @macsko @erictune

44past4 · 2025-12-01T09:21:16Z

keps/sig-scheduling/5710-workload-aware-preemption/README.md

+1. Identify the list of potential victims:
+   - all running workloads with (preemption) priority lower than the new workload W
+   - all individual pods (not being part of workloads) with priority lower than the new workload W


Having two independent priorities for a workload - one for scheduling and one for the preemption or the single preemption priority which can be dynamically updated can potentially lead to a cycle in the preemption.

Let's assume that we have an existing workload A with high scheduling priority and low preemption priority running in a cluster.

Now let's assume that we want to schedule a workload B which has medium scheduling priority and medium preemption priority.

Workload B will preempt workload A and will start to run because its scheduling priority > preemption priority of the workload A.

However when workload A will restart and it will be rescheduled it will preempt workload B and will start to run because its scheduling priority > preemption priority of workload B.

The same issue can happen if we will have only one priority but this priority will be reduced while the workload is running. After preemption when the workload will reappear with the original higher priority it can preempt the workload which has preempted it.

One potential solution / mitigation to the described problem could be stating that preemption priority >= scheduling priority. This way after restarting the preempted workload will not be able to preempt the preemptor workload.

Thanks for point that out!

Yeah - "preemption priority >= scheduling priority" is definitely desired. I don't think we have any usecases that would benefit from the reversed.

That said, I need to think a bit more if that is enough. I think it prevents the cycles if we assume static priorities, but it can still potentially trigger cycles if the priorities will be changing. OTOH, if the priorities are changing this is probably desired.

Let me think about it a bit more and I will update the KEP to reflect the thoughts later this week.

sanposhiho · 2025-12-01T13:28:35Z

/assign

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Nov 28, 2025

k8s-ci-robot requested review from dom4ha and macsko November 28, 2025 14:26

k8s-ci-robot added kind/kep Categorizes KEP tracking issues and PRs modifying the KEP directory sig/scheduling Categorizes an issue or PR as relevant to SIG Scheduling. labels Nov 28, 2025

github-project-automation bot added this to SIG Scheduling Nov 28, 2025

k8s-ci-robot added the size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. label Nov 28, 2025

wojtek-t force-pushed the workload_aware_preemption branch from 672aa68 to ce04eca Compare December 1, 2025 08:21

Workload-aware preemption KEP

0ff3958

wojtek-t force-pushed the workload_aware_preemption branch from ce04eca to 0ff3958 Compare December 1, 2025 08:52

44past4 reviewed Dec 1, 2025

View reviewed changes

k8s-ci-robot assigned sanposhiho Dec 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[KEP-5710]: Workload-aware preemption KEP #5711

[KEP-5710]: Workload-aware preemption KEP #5711

wojtek-t commented Nov 28, 2025

Uh oh!

k8s-ci-robot commented Nov 28, 2025

Uh oh!

wojtek-t commented Nov 28, 2025

Uh oh!

44past4 Dec 1, 2025 •

edited

Loading

Uh oh!

44past4 Dec 1, 2025

Uh oh!

wojtek-t Dec 1, 2025

Uh oh!

sanposhiho commented Dec 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[KEP-5710]: Workload-aware preemption KEP #5711

Are you sure you want to change the base?

[KEP-5710]: Workload-aware preemption KEP #5711

Conversation

wojtek-t commented Nov 28, 2025

Uh oh!

k8s-ci-robot commented Nov 28, 2025

Uh oh!

wojtek-t commented Nov 28, 2025

Uh oh!

44past4 Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

44past4 Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

wojtek-t Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

sanposhiho commented Dec 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

44past4 Dec 1, 2025 •

edited

Loading