Prepare code for two-level scheduling #5469

lchrzaszcz · 2025-06-03T11:31:19Z

What type of PR is this?

/kind cleanup

What this PR does / why we need it:

This PRs introduces some preparation changes that do not change the logic of the code, but makes it easier to review https://github.com/kubernetes-sigs/kueue/pull/5353/files which is TAS two-level scheduling

Relates to: #5439
Preparation PR for: https://github.com/kubernetes-sigs/kueue/pull/5353/files

Which issue(s) this PR fixes:

Special notes for your reviewer:

Does this PR introduce a user-facing change?

NONE

netlify · 2025-06-03T11:31:25Z

✅ Deploy Preview for kubernetes-sigs-kueue canceled.

Name	Link
🔨 Latest commit	`35f49cc`
🔍 Latest deploy log	https://app.netlify.com/projects/kubernetes-sigs-kueue/deploys/683ee76ae05031000861ec54

k8s-ci-robot · 2025-06-03T11:31:31Z

Hi @lchrzaszcz. Thanks for your PR.

I'm waiting for a kubernetes-sigs member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

lchrzaszcz · 2025-06-03T11:45:04Z

pkg/cache/tas_flavor_snapshot.go

 	bestFitIdx := 0
 	for i, domain := range domains {
-		if domain.state >= count && domain.state != domains[bestFitIdx].state {
+		if domain.state >= count && domain.state < domains[bestFitIdx].state {


Previous code relied on a fact that domains are in descending order, so "!=" works perfectly in finding the tightest fit. I'm just changing it to "<" to make it more explicit.

mimowo · 2025-06-03T11:45:30Z

pkg/cache/tas_flavor_snapshot.go

+	if a.state == b.state {
+		return slices.Compare(a.levelValues, b.levelValues)
+	}


I'm not convinced I like the proposed abtraction - I would prefer avoid duplicating this.

From the perspective of this PR it seems redundant, but looking at introduction or chunks, there'll be more custom logic to both modes.

What we could do is something like that:

if a.state == b.state { return slices.Compare(a.levelValues, b.levelValues) } if useLeastFreeCapacityAlgorithm(unconstrained) { // ascending order return cmp.Compare(a.state, b.state) } else { // descending order return cmp.Compare(b.state, a.state) }

What do you think?

ok, I think the reason we are going to update this logic while working on two-level scheduling is motivation to avoid duplication, and thus diverging the code more in the future

Got it, well I've just realized that with suggested solution I'm reinventing the wheel and the code look like that before. So I'm reverting my proposed change and I'll revert similar change in two-level scheduling PR.

lchrzaszcz · 2025-06-03T11:49:01Z

pkg/cache/tas_flavor_snapshot.go

 			}
 			results = append(results, sortedDomain[idx+offset])
-			remainingCount -= sortedDomain[idx].state
+			remainingCount -= sortedDomain[idx+offset].state


The code should subtract the chosen domain state from remainingCount. Old code subtracted not the chosen one, but next in line. It is the same for all domains, apart from the last one in BestFit. The old code worked ok, because remainingCount is a local variable, and we only care it is "not greater than 0" at the end of the function, so even if it was lower than 0 it was fine.

I'm fixing it to account for the optimized last domain correctly.

gabesaba

/lgtm

will leave approval for @mimowo

k8s-ci-robot · 2025-06-03T12:02:09Z

LGTM label has been added.

Git tree hash: ffdf58608c9950c875679e783bfb37fd3c9f0c05

mimowo · 2025-06-03T12:10:34Z

I would like to wait for #5469 (comment)

mimowo

/lgtm
/approve
Thanks 👍

k8s-ci-robot · 2025-06-03T12:17:49Z

LGTM label has been added.

Git tree hash: 0d6e3978a71601639d29505149d1321cdefc15f2

k8s-ci-robot · 2025-06-03T12:17:51Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: lchrzaszcz, mimowo

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~pkg/cache/OWNERS~~ [mimowo]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

lchrzaszcz · 2025-06-03T12:17:58Z

I would like to wait for #5469 (comment)

Done. I've reverted the code as it looked like that before my changes.

mimowo · 2025-06-03T12:18:22Z

/ok-to-test
/hold
to make sure tests pass, feel free to unhold

lchrzaszcz · 2025-06-03T13:02:27Z

/unhold

k8s-ci-robot added do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. do-not-merge/release-note-label-needed Indicates that a PR should not merge because it's missing one of the release note labels. labels Jun 3, 2025

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Jun 3, 2025

k8s-ci-robot requested review from mbobrovskyi and tenzen-y June 3, 2025 11:31

k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jun 3, 2025

k8s-ci-robot added needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Jun 3, 2025

lchrzaszcz force-pushed the two-level-tas-scheduling-preparation branch from 9f737a5 to f9af513 Compare June 3, 2025 11:40

k8s-ci-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jun 3, 2025

lchrzaszcz commented Jun 3, 2025

View reviewed changes

mimowo reviewed Jun 3, 2025

View reviewed changes

lchrzaszcz commented Jun 3, 2025

View reviewed changes

lchrzaszcz marked this pull request as ready for review June 3, 2025 11:58

k8s-ci-robot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jun 3, 2025

k8s-ci-robot requested a review from PBundyra June 3, 2025 11:59

gabesaba reviewed Jun 3, 2025

View reviewed changes

k8s-ci-robot assigned gabesaba Jun 3, 2025

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jun 3, 2025

Prepare code for two-level scheduling

35f49cc

lchrzaszcz force-pushed the two-level-tas-scheduling-preparation branch from f9af513 to 35f49cc Compare June 3, 2025 12:15

k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jun 3, 2025

k8s-ci-robot requested a review from gabesaba June 3, 2025 12:15

k8s-ci-robot added size/S Denotes a PR that changes 10-29 lines, ignoring generated files. and removed size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Jun 3, 2025

mimowo reviewed Jun 3, 2025

View reviewed changes

k8s-ci-robot assigned mimowo Jun 3, 2025

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jun 3, 2025

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jun 3, 2025

k8s-ci-robot merged commit 310626e into kubernetes-sigs:main Jun 3, 2025
23 checks passed

k8s-ci-robot added this to the v0.13 milestone Jun 3, 2025

lchrzaszcz deleted the two-level-tas-scheduling-preparation branch June 3, 2025 13:08

ichekrygin pushed a commit to ichekrygin/kueue that referenced this pull request Jun 5, 2025

Prepare code for two-level scheduling (kubernetes-sigs#5469)

4660bd0

lchrzaszcz mentioned this pull request Jul 11, 2025

REQUEST: New membership for lchrzaszcz kubernetes/org#5696

Closed

11 tasks

Prepare code for two-level scheduling #5469

Prepare code for two-level scheduling #5469

Uh oh!

Conversation

lchrzaszcz commented Jun 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What type of PR is this?

What this PR does / why we need it:

Which issue(s) this PR fixes:

Special notes for your reviewer:

Does this PR introduce a user-facing change?

Uh oh!

netlify bot commented Jun 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for kubernetes-sigs-kueue canceled.

Uh oh!

k8s-ci-robot commented Jun 3, 2025

Uh oh!

lchrzaszcz Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

mimowo Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

lchrzaszcz Jun 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mimowo Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

mimowo Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

lchrzaszcz Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

lchrzaszcz Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

gabesaba left a comment

Choose a reason for hiding this comment

Uh oh!

k8s-ci-robot commented Jun 3, 2025

Uh oh!

mimowo commented Jun 3, 2025

Uh oh!

mimowo left a comment

Choose a reason for hiding this comment

Uh oh!

k8s-ci-robot commented Jun 3, 2025

Uh oh!

k8s-ci-robot commented Jun 3, 2025

Uh oh!

lchrzaszcz commented Jun 3, 2025

Uh oh!

mimowo commented Jun 3, 2025

Uh oh!

lchrzaszcz commented Jun 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

lchrzaszcz commented Jun 3, 2025 •

edited

Loading

netlify bot commented Jun 3, 2025 •

edited

Loading

lchrzaszcz Jun 3, 2025 •

edited

Loading