Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Normal
Fix Version/s: 4.20.0
Affects Version/s: 4.17.z, 4.18.z, 4.19.z, 4.20
Component/s: Cluster Autoscaler
Labels:
None

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
3
Severity:
None
Regression:
None

Target Backport Versions:
None
Target Version:

4.20
Release Blocker:
None
Sprint:
AUTOSCALE - Sprint 272
sprint_count:
1

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

PX Impact Score:

Release Note Status:
Proposed
Release Note Type:
Bug Fix
Release Note Text:

Hide
* Before this update, when you created a cluster autoscaler object with the Cluster Autoscaler Operator enabled in the cluster, two `cluster-autoscaler-default` pods in the `openshift-machine-api` were sometimes created at the same time and one of the pods was immediately killed. With this release, only one pod is created. (link:https://issues.redhat.com/browse/OCPBUGS-57041[~~OCPBUGS-57041~~])

Show
* Before this update, when you created a cluster autoscaler object with the Cluster Autoscaler Operator enabled in the cluster, two `cluster-autoscaler-default` pods in the `openshift-machine-api` were sometimes created at the same time and one of the pods was immediately killed. With this release, only one pod is created. (link: https://issues.redhat.com/browse/OCPBUGS-57041 [ OCPBUGS-57041 ])

Escape Reason:
None
Escape Impact:
None
Corrective Measures:
None
SDLC stage when should've been found:
None

Description of problem:

    When you first create a ClusterAutoscaler object, sometimes 2 cluster-autoscaler-default pods start up, and one is killed automatically.

Version-Release number of selected component (if applicable):

    4.20

How reproducible:

    Only the first time a clusterautoscaler operator has created a ClusterAutoscaler object, or if there doesn't exist a ClusterAutoscaler object that the operator has ever observed.

Steps to Reproduce:

    1. Install a non-hcp cluster with CAS enabled.
    2. Create a ClusterAutoscaler object like this:

apiVersion: "autoscaling.openshift.io/v1"
kind: "ClusterAutoscaler"
metadata:
  name: "default"
spec:
  logVerbosity: 6
  balanceSimilarNodeGroups: true
  ignoreDaemonsetsUtilization: false
  skipNodesWithLocalStorage: true
  podPriorityThreshold: -10
  resourceLimits:
    maxNodesTotal: 24
    cores:
      min: 8
      max: 128
    memory:
      min: 4
      max: 256
  scaleDown:
    enabled: true
    # How long after scale up that scale down evaluation resumes - if omitted defaults to 10 minutes
    delayAfterAdd: 1m
    # How long after node deletion that scale down evaluation resumes - if omitted defaults to 0 seconds
    delayAfterDelete: 1m
    # How long after scale down failure that scale down evaluation resumes - if omitted defaults to 3 minutes
    delayAfterFailure: 1m
    # How long a node should be unneeded before it is eligible for scale down - if omitted defaults to 10 minutes
    uneededTime: 1m
    # Node utilization level, defined as sum of requested resources divided by capacity, below which a node can be considered for scale down - if omitted defaults to 0.5
    utilizationThreshold: "0.4"     

3. Watch all pods in the openshift-machine-api namespace, and observe that 2 pods are created, but 1 is immediately killed.

Actual results:

    Two cluster-autoscaler-default pods are started but 1 is killed immediately.

Expected results:

    Only one cluster-autoscaler-default pod should be created.

Additional info:

is related to

OCPBUGS-42875 clusteroperator/cluster-autoscaler blips Degraded=True during upgrade test

links to

openshift/cluster-autoscaler-operator#343: OCPBUGS-57041: update cluster provider type early

Assignee:: Max Cao

Reporter:: Max Cao

Need Info From:: None

Contributors:: None

QA Contact:: Paul Rozehnal

Doc Contact:: None

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Created:: 2025/06/03 7:52 PM

Updated:: 2025/10/21 4:33 AM

Resolved:: 2025/10/21 4:33 AM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates

Hide