cohort resources are getting over borrowed when multiple cluster queues are borrowing resources #5289

alaypatel07 · 2025-05-20T03:09:16Z

What happened:

A cohort was created for team-ab with 4 cpu and 4GiB memory
A cluster queue for team-a with nominal quota of 2 CPU and 2 GiB memory, with cohort of team-ab
A cluster queue for team-b with nominal quota of 2 CPU and 2 GiB memory, with cohort of team-ab
Two jobs were created in team-a namespace requesting total of 4 CPUs and 4GiB memory. This leads to borrowing from team-b quota
A job was created in team-b namespace requesting 4 CPU and 4GiB memory, this workload was admitted

What you expected to happen:

I expected the job in team-b to be in pending state because team-b doesnt have enough quota. Instead it was in running state.

How to reproduce it (as minimally and precisely as possible):

create a cohort

cat <<EOF | k apply -f -
apiVersion: kueue.x-k8s.io/v1alpha1
kind: Cohort
metadata:
  name: team-ab
spec:
  resourceGroups:
  - coveredResources: ["cpu", "memory"]
    flavors:
      - name: default-flavor
        resources:
          - name: cpu
            nominalQuota: 4  # Total CPU quota in the cohort (2 + 2)
          - name: memory
            nominalQuota: 4Gi  # Total memory quota in the cohort (2Gi + 2Gi)

create team-a, team-b clusterqueue and local queue

cat <<EOF | k apply -f -
apiVersion: kueue.x-k8s.io/v1beta1
kind: ClusterQueue
metadata:
  name: "team-a-cq"
spec:
  namespaceSelector: {} # match all
  cohort: "team-ab"     # Join this cohort for resource sharing
  resourceGroups:
  - coveredResources: ["cpu", "memory"]
    flavors:
    - name: "default-flavor"
      resources:
      - name: "cpu"
        nominalQuota: 2  # Guaranteed quota
      - name: "memory"
        nominalQuota: 2Gi
---
apiVersion: kueue.x-k8s.io/v1beta1
kind: ClusterQueue
metadata:
  name: "team-b-cq"
spec:
  namespaceSelector: {} # match all
  cohort: "team-ab"     # Same cohort for resource sharing
  resourceGroups:
  - coveredResources: ["cpu", "memory"]
    flavors:
    - name: "default-flavor"
      resources:
      - name: "cpu"
        nominalQuota: 2
      - name: "memory"
        nominalQuota: 2Gi
---
apiVersion: kueue.x-k8s.io/v1beta1
kind: LocalQueue
metadata:
  name: team-a-queue
  namespace: team-a
spec:
  clusterQueue: team-a-cq
---
apiVersion: kueue.x-k8s.io/v1beta1
kind: LocalQueue
metadata:
  name: team-b-queue
  namespace: team-b
spec:
  clusterQueue: team-b-cq
EOF

create team-a workload

cat <<EOF | k apply -f -
apiVersion: batch/v1
kind: Job
metadata:
  name: team-a-job-0
  namespace: team-a
  annotations:
    kueue.x-k8s.io/queue-name: team-a-queue
    kueue.x-k8s.io/priority: "10" # Lower priority
spec:
  parallelism: 1
  completions: 1
  template:
    spec:
      containers:
      - name: worker
        image: busybox
        command: ["sleep", "3600"] # Job runs for 1 hour
        resources:
          requests:
            cpu: "2"     # Using all team-a CPU quota
            memory: "2Gi" # Using all team-a memory quota
      restartPolicy: Never
---
apiVersion: batch/v1
kind: Job
metadata:
  name: team-a-job-1
  namespace: team-a
  annotations:
    kueue.x-k8s.io/queue-name: team-a-queue
    kueue.x-k8s.io/priority: "10" # Lower priority
spec:
  parallelism: 1
  completions: 1
  template:
    spec:
      containers:
      - name: worker
        image: busybox
        command: ["sleep", "3600"] # Job runs for 1 hour
        resources:
          requests:
            cpu: "2"     # borrowing all team-b CPU quota
            memory: "2Gi" # borrowing all team-b memory quota
      restartPolicy: Never
EOF

create team-b job

$ cat <<EOF | k apply -f -
apiVersion: batch/v1
kind: Job
metadata:
  name: team-b-job
  namespace: team-b
  annotations:
    kueue.x-k8s.io/queue-name: team-b-queue
    kueue.x-k8s.io/priority: "10" # Lower priority
spec:
  parallelism: 1
  completions: 1
  template:
    spec:
      containers:
      - name: worker
        image: busybox
        command: ["sleep", "3600"] # Job runs for 1 hour
        resources:
          requests:
            cpu: "2"     # Using all team-b CPU quota
            memory: "2Gi" # Using all team-b memory quota
      restartPolicy: Never

Check for running jobs

$  kubectl get jobs -A
NAMESPACE   NAME           STATUS    COMPLETIONS   DURATION   AGE
team-a      team-a-job-0   Running   0/1           75s        75s
team-a      team-a-job-1   Running   0/1           75s        75s
team-b      team-b-job     Running   0/1           68s        68s

Anything else we need to know?:

Environment:

Kubernetes version (use kubectl version):
$ k version Client Version: v1.32.3 Kustomize Version: v5.5.0 Server Version: v1.32.0
Kueue version (use git describe --tags --dirty --always): v0.11.4
Cloud provider or hardware configuration: minikube
OS (e.g: cat /etc/os-release):
Kernel (e.g. uname -a):
Install tools:
Others:

The text was updated successfully, but these errors were encountered:

mimowo · 2025-05-20T05:05:50Z

cc @gabesaba ptal

gabesaba · 2025-05-20T08:56:49Z

Hi @alaypatel07, this quota defined at Cohort level is additive: so there is a total of 8CPU/8Gi available in the entire Cohort

gabesaba · 2025-05-20T09:02:05Z

I will update the docs to make this more clear - this is not the first time a user expected this semantic. @alaypatel07, were there any docs in particular which gave you the impression that the resources defined at Cohort level worked in this way?

alaypatel07 · 2025-05-20T13:41:00Z

@gabesaba I was reading from this doc https://kueue.sigs.k8s.io/docs/concepts/cohort/#configuring-quotas, I dont see it being mentioned anywhere that quotas on cohorts are additive.

Can you please be more clear on what additive means? If there are 4 cluster queues belonging to a cohort and the cohort defines nominalquota of 2 CPU, then in total there will be quota of 10 CPUs, 2 for each clusterqueue?

gabesaba · 2025-05-20T14:00:06Z

@gabesaba I was reading from this doc https://kueue.sigs.k8s.io/docs/concepts/cohort/#configuring-quotas, I dont see it being mentioned anywhere that quotas on cohorts are additive.

Can you please be more clear on what additive means? If there are 4 cluster queues belonging to a cohort and the cohort defines nominalquota of 2 CPU, then in total there will be quota of 10 CPUs, 2 for each clusterqueue?

In that case, there will just be 2CPU quota, assuming that the ClusterQueues do not define any quota. I just meant that the Resources defined at the Cohort level is independent of quotas at ClusterQueue. These numbers may be added up to determine total capacity. E.g.:

Structure

Cohort (1gb memory)
- CQ (1 CPU, 1gb memory)

Total Resources Available in Cohort

1CPU, 2gb memory

alaypatel07 · 2025-05-20T14:11:25Z

Ohh I see, I think I had a different mental model of the system. I assumed that quota needs to be defined at cohort once and then ClusterQueues can take smaller slices of resources from the quota at cohort level. This is clearly not true.

Can you please help put this in documentation? I will be happy to review the doc PR.

alaypatel07 added the kind/bug Categorizes issue or PR as related to a bug. label May 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

cohort resources are getting over borrowed when multiple cluster queues are borrowing resources #5289

cohort resources are getting over borrowed when multiple cluster queues are borrowing resources #5289

alaypatel07 commented May 20, 2025

mimowo commented May 20, 2025

Uh oh!

gabesaba commented May 20, 2025

Uh oh!

gabesaba commented May 20, 2025

Uh oh!

alaypatel07 commented May 20, 2025

Uh oh!

gabesaba commented May 20, 2025

Uh oh!

alaypatel07 commented May 20, 2025

Uh oh!

cohort resources are getting over borrowed when multiple cluster queues are borrowing resources #5289

cohort resources are getting over borrowed when multiple cluster queues are borrowing resources #5289

Comments

alaypatel07 commented May 20, 2025

mimowo commented May 20, 2025

Uh oh!

gabesaba commented May 20, 2025

Uh oh!

gabesaba commented May 20, 2025

Uh oh!

alaypatel07 commented May 20, 2025

Uh oh!

gabesaba commented May 20, 2025

Uh oh!

alaypatel07 commented May 20, 2025

Uh oh!