Skip to content

Workloads requesting TAS cannot run via MultiKueue #5313

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
mimowo opened this issue May 22, 2025 · 5 comments · May be fixed by #5361
Open

Workloads requesting TAS cannot run via MultiKueue #5313

mimowo opened this issue May 22, 2025 · 5 comments · May be fixed by #5361
Assignees
Labels
kind/bug Categorizes issue or PR as related to a bug.

Comments

@mimowo
Copy link
Contributor

mimowo commented May 22, 2025

What happened:

We cannot run workloads requesting TAS via MultiKueue. This is problematic, because TAS workloads have dedicated annotations like "preferred: rack". However workloads with such annotations only match TAS ResourceFlavors.

What you expected to happen:

We can configure Kueue to schedule workloads requesting TAS via MultiKueue. They should execute on the workers, but we should have a way to run them.

The natural approach is to configure TAS RF on the management cluster (both on manager and worker).

How to reproduce it (as minimally and precisely as possible):

  1. Setup MultiKueue and configure CQ using TAS ResourceFlavor. The CQ will get deactivated immediately because of the soft validation we have:
    if len(c.multiKueueAdmissionChecks) > 1 {
    reasons = append(reasons, kueue.ClusterQueueActiveReasonMultipleMultiKueueAdmissionChecks)
    messages = append(messages, fmt.Sprintf("Cannot use multiple MultiKueue AdmissionChecks on the same ClusterQueue, found: %v", stringsutils.Join(c.multiKueueAdmissionChecks, ",")))
    }
@mimowo mimowo added the kind/bug Categorizes issue or PR as related to a bug. label May 22, 2025
@mimowo
Copy link
Contributor Author

mimowo commented May 22, 2025

/assign @IrvingMg
who already started to look into running TAS workloads over MultiKueue and hit the issue.

cc @mwysokin @mwielgus @tenzen-y @mszadkow

@kerthcet
Copy link
Contributor

Hey, what's the current status with multiKueue + TAS? No updates on website I believe.

@tenzen-y
Copy link
Member

Hey, what's the current status with multiKueue + TAS? No updates on website I believe.

As described in this issue, TAS+MultiKueue does not work well even though you use Kueue v0.12.0.

@tenzen-y
Copy link
Member

As a feature stage, TAS is alpha, and MultiKueue is Beta. However, we do not have any feature stage against that combination.

@kerthcet
Copy link
Contributor

Thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/bug Categorizes issue or PR as related to a bug.
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants