Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-35952

Autoscaler balance similar node groups failed on aws

XMLWordPrintable

    • Moderate
    • Yes
    • False
    • Hide

      None

      Show
      None
    • Hide
      * The AWS Cloud Controller Manager adds an AWS-specific zone identifier label to `Node` objects.
      The value of the label can vary between nodes which are otherwise similar in topology, causing the cluster autoscaler to not consider them as similar.
      This release updates the Cluster Autoscaler Operator configuration for AWS to ignore this label so that the nodes are considered as similar.
      (link:https://issues.redhat.com/browse/OCPBUGS-35952[*OCPBUGS-35952*])
      Show
      * The AWS Cloud Controller Manager adds an AWS-specific zone identifier label to `Node` objects. The value of the label can vary between nodes which are otherwise similar in topology, causing the cluster autoscaler to not consider them as similar. This release updates the Cluster Autoscaler Operator configuration for AWS to ignore this label so that the nodes are considered as similar. (link: https://issues.redhat.com/browse/OCPBUGS-35952 [* OCPBUGS-35952 *])
    • Release Note Not Required
    • In Progress

      Description of problem:

      Autoscaler balance similar node groups failed on aws when run regression for https://issues.redhat.com/browse/OCPCLOUD-2616

      Version-Release number of selected component (if applicable):

      4.17.0-0.nightly-2024-06-20-165244

      How reproducible:

      Always 

      Steps to Reproduce:

      1. Create clusterautoscaler with balanceSimilarNodeGroups: true
      2. Create 2 machineautoscaler min/max 1/8
      3. Add workload
      

      Actual results:

      Couldn't see the "splitting scale-up" message from the cluster-autoscalerlogs. 
      must-gather: https://drive.google.com/file/d/17aZmfQHKZxJEtqPvl37HPXkXA36Yp6i8/view?usp=sharing 2024-06-21T13:21:08.678016167Z 
      
      I0621 13:21:08.678006       1 compare_nodegroups.go:157] nodes template-node-for-MachineSet/openshift-machine-api/zhsun-aws21-5slwv-worker-us-east-2b-5109433294514062211 and template-node-for-MachineSet/openshift-machine-api/zhsun-aws21-5slwv-worker-us-east-2c-760092546639056043 are not similar, labels do not match
      2024-06-21T13:21:08.678030474Z I0621 13:21:08.678021       1 orchestrator.go:249] No similar node groups found 

      Expected results:

      balanceSimilarNodeGroups works well 

      Additional info:

       

            mimccune@redhat.com Michael McCune
            rhn-support-zhsun Zhaohua Sun
            Zhaohua Sun Zhaohua Sun
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

              Created:
              Updated:
              Resolved: