Policy never evicting pods, despite finding fits #1627

jekawo · 2025-02-07T15:28:26Z

What version of descheduler are you using?

descheduler version:
0.32.1
Helm chart:

Does this issue reproduce with the latest release?
on latest

Which descheduler CLI options are you using?

Please provide a copy of your descheduler policy config file

kind: Deployment

# labels that'll be applied to all resources
commonLabels: { "Descheduler" }

# Required when running as a Deployment
deschedulingInterval: 15m

cmdOptions:
  v: 5

evictionFailureEventNotification : true

# deschedulerPolicy contains the policies the descheduler will execute.
# To use policies stored in an existing configMap use:
# NOTE: The name of the cm should comply to {{ template "descheduler.fullname" . }}
deschedulerPolicy:
  nodeSelector: "agentpool=nodepool1"
  profiles:
    - name: default
      pluginConfig:
        - name: DefaultEvictor
          nodeFit: true
        - name: RemovePodsViolatingNodeAffinity
          args:
            nodeAffinityType:
              - requiredDuringSchedulingIgnoredDuringExecution
              - preferredDuringSchedulingIgnoredDuringExecution
            namespaces:
              include:
                - staging
                #- production
        - name: RemovePodsViolatingNodeTaints
          args:
            namespaces:
              include:
                - staging
                #- production
        - name: RemovePodsViolatingInterPodAntiAffinity
          args:
            namespaces:
              include:
                - development
                - staging
                #- production
      plugins:
        deschedule:
          enabled:
            - RemovePodsViolatingNodeTaints
            - RemovePodsViolatingNodeAffinity
            - RemovePodsViolatingInterPodAntiAffinity

What k8s version are you using (kubectl version)?

kubectl version Output

$ kubectl version
Client Version: v1.30.5
Kustomize Version: v5.0.4-0.20230601165947-6ce0bf390ce3
Server Version: v1.31.3

What did you do?

Deployed descheduler with the above config, expecting pods to be evicted from nodepool1 if they would fit better in staging at the current point in time.

Logs bear witness and acknowledges that staging nodes fit, and thus should be evicted.

However descheduler continously reports 0 evictions and 0 attempts, it also does not show an eviction error.

I added the pod disruption budget rules to the cluster role manually to get the current version working.

What did you expect to see?

Pods evicted if they could fit on tolerated nodes.

What did you see instead?

0 pods evicted ever.

The text was updated successfully, but these errors were encountered:

AlexGurtoff · 2025-02-20T12:32:32Z

We are experiencing the same issue. We have such configMap:

apiVersion: "descheduler/v1alpha2"
kind: "DeschedulerPolicy"
profiles:
- name: default-profile
  pluginConfig:
  - args:
      nodeAffinityType:
      - preferredDuringSchedulingIgnoredDuringExecution
    name: RemovePodsViolatingNodeAffinity
  plugins:
    deschedule:
      enabled:
      - RemovePodsViolatingNodeAffinity

Also, we have the pod which is deployed onto node. It has such nodeAffinity

  affinity:
    nodeAffinity:
      preferredDuringSchedulingIgnoredDuringExecution:
        - weight: 100
          preference:
            matchExpressions:
              - key: cloud.google.com/gke-spot
                operator: Exists

And there are a second node which is "better" (it has cloud.google.com/gke-spot). But descheduler do nothing

I0220 12:29:51.668857 1 descheduler.go:173] Setting up the pod evictor
I0220 12:29:51.668964 1 node_affinity.go:81] "Executing for nodeAffinityType" nodeAffinity="preferredDuringSchedulingIgnoredDuringExecution"
...
I0220 12:29:51.680954 1 node_affinity.go:121] "Processing node" node="some_node_name"
...
I0220 12:29:51.687967 1 profile.go:317] "Total number of pods evicted" extension point="Deschedule" evictedPods=0
I0220 12:29:51.688006 1 descheduler.go:179] "Number of evicted pods" totalEvicted=0

AlexGurtoff · 2025-02-20T13:23:27Z

I found the issue. I needed to pass an additional parameter: evictLocalStoragePods.

    profiles:
    - name: default
      pluginConfig:
      - args:
          evictLocalStoragePods: true
        name: DefaultEvictor

With this change, it started working.

Additionally, increasing the verbosity level can be helpful for debugging:

        cmdOptions:
          v: 5

jekawo · 2025-03-24T08:27:47Z

I found the issue. I needed to pass an additional parameter: evictLocalStoragePods.
    profiles:
    - name: default
      pluginConfig:
      - args:
          evictLocalStoragePods: true
        name: DefaultEvictor
With this change, it started working.

Additionally, increasing the verbosity level can be helpful for debugging:
        cmdOptions:
          v: 5

I already am using highest verbosity, and you fix did not fix it for us.

Montelime · 2025-04-10T17:32:34Z

What version of descheduler are you using?

descheduler version: 0.32.1 Helm chart:

Does this issue reproduce with the latest release? on latest

Which descheduler CLI options are you using?

Please provide a copy of your descheduler policy config file
kind: Deployment

# labels that'll be applied to all resources
commonLabels: { "Descheduler" }

# Required when running as a Deployment
deschedulingInterval: 15m

cmdOptions:
  v: 5

evictionFailureEventNotification : true

# deschedulerPolicy contains the policies the descheduler will execute.
# To use policies stored in an existing configMap use:
# NOTE: The name of the cm should comply to {{ template "descheduler.fullname" . }}
deschedulerPolicy:
  nodeSelector: "agentpool=nodepool1"
  profiles:
    - name: default
      pluginConfig:
        - name: DefaultEvictor
          nodeFit: true
        - name: RemovePodsViolatingNodeAffinity
          args:
            nodeAffinityType:
              - requiredDuringSchedulingIgnoredDuringExecution
              - preferredDuringSchedulingIgnoredDuringExecution
            namespaces:
              include:
                - staging
                #- production
        - name: RemovePodsViolatingNodeTaints
          args:
            namespaces:
              include:
                - staging
                #- production
        - name: RemovePodsViolatingInterPodAntiAffinity
          args:
            namespaces:
              include:
                - development
                - staging
                #- production
      plugins:
        deschedule:
          enabled:
            - RemovePodsViolatingNodeTaints
            - RemovePodsViolatingNodeAffinity
            - RemovePodsViolatingInterPodAntiAffinity
What k8s version are you using (kubectl version)?

kubectl version Output

What did you do? Deployed descheduler with the above config, expecting pods to be evicted from nodepool1 if they would fit better in staging at the current point in time. Logs bear witness and acknowledges that staging nodes fit, and thus should be evicted. However descheduler continously reports 0 evictions and 0 attempts, it also does not show an eviction error.

I added the pod disruption budget rules to the cluster role manually to get the current version working.

What did you expect to see? Pods evicted if they could fit on tolerated nodes.

What did you see instead? 0 pods evicted ever.

n

jekawo added the kind/bug Categorizes issue or PR as related to a bug. label Feb 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Policy never evicting pods, despite finding fits #1627

Policy never evicting pods, despite finding fits #1627

jekawo commented Feb 7, 2025 •

edited

Loading

AlexGurtoff commented Feb 20, 2025

Uh oh!

AlexGurtoff commented Feb 20, 2025

Uh oh!

jekawo commented Mar 24, 2025

Uh oh!

Montelime commented Apr 10, 2025

Uh oh!

Policy never evicting pods, despite finding fits #1627

Policy never evicting pods, despite finding fits #1627

Comments

jekawo commented Feb 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

AlexGurtoff commented Feb 20, 2025

Uh oh!

AlexGurtoff commented Feb 20, 2025

Uh oh!

jekawo commented Mar 24, 2025

Uh oh!

Montelime commented Apr 10, 2025

Uh oh!

jekawo commented Feb 7, 2025 •

edited

Loading