Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-42493

Rosa HCP: Unable to upgrade machinepool 4.16->4.17

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Major Major
    • None
    • 4.17
    • HyperShift
    • None
    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • None
    • Critical
    • None
    • None
    • None
    • Rejected
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      Please change component if this is not the correct area

      Description of problem:

      I have a 3 node Rosa HCP cluster that I put some load on and started an upgrade. The upgrade cluster command finished and I saw operators and overall cluster version upgrade properly. When I am trying to upgrade the machinepool I see no progress being seen after over an hour.
      

      Version-Release number of selected component (if applicable):

      4.17.0-rc.5

      How reproducible:

          100%

      Steps to Reproduce:

          1. Create 3 node multiaz Rosa HCP cluster
          2. Load cluster with kube-burner-ocp (cluster-density-v2 workload with 10 iterations)
          3. Upgrade cluster:  rosa upgrade cluster -y -m auto --version 4.17.0-rc.5 -c 2e0u2j2jobn10lok6dsmg7lvs4urts6s --control-plane
          4. Wait for upgrade to finish and upgrade machinepools 
      rosa upgrade machinepool workers-0 -y -c 2e0u2j2jobn10lok6dsmg7lvs4urts6s --version 4.17.0-rc.5
      
      rosa upgrade machinepool workers-1 -y -c 2e0u2j2jobn10lok6dsmg7lvs4urts6s --version 4.17.0-rc.5
      
      rosa upgrade machinepool workers-2 -y -c 2e0u2j2jobn10lok6dsmg7lvs4urts6s --version 4.17.0-rc.5
          
         5. Machinepools never update
      
      

       

      Actual results:

      Machinepools never update, the machine pools scheduled update time passes and no progress seems to be done

      Expected results:

          Upgrade and upgraded machine pools get to wanted version with all operators and nodes ready 

      Additional info:

       
       
      Details of cluster during machine pools trying to update. 
      
      %rosa list machinepool --cluster sdq-longname-evtvb-xnsexixaycfjnuqyxpjxwmumrxhstbpjtig
      ID         AUTOSCALING  REPLICAS  INSTANCE TYPE  LABELS    TAINTS    AVAILABILITY ZONE  SUBNET                    DISK SIZE  VERSION  AUTOREPAIR  
      workers-0  Yes          1/1-1     m5.xlarge                          us-west-2c         subnet-00e106c25cb9ec639  150 GiB    4.16.13  Yes         
      workers-1  Yes          1/1-1     m5.xlarge                          us-west-2a         subnet-0281f0fe441bbcc14  150 GiB    4.16.13  Yes         
      workers-2  Yes          1/1-1     m5.xlarge                          us-west-2b         subnet-060fece8fe9a2175f  150 GiB    4.16.13  Yes         
      
            % oc get co NAME                                       VERSION       AVAILABLE   PROGRESSING   DEGRADED   SINCE   MESSAGE 
      console                                    4.17.0-rc.5   True        False         False      61m      
      csi-snapshot-controller                    4.17.0-rc.5   True        False         False      11m      
      dns                                        4.17.0-rc.5   True        False         False      61m     
      image-registry                             4.17.0-rc.5   True        False         False      61m      
      ingress                                    4.17.0-rc.5   True        False         False      61m     
       insights                                   4.17.0-rc.5   True        False         False      62m      
      kube-apiserver                             4.17.0-rc.5   True        False         False      70m      
      kube-controller-manager                    4.17.0-rc.5   True        False         False      70m      
      kube-scheduler                             4.17.0-rc.5   True        False         False      70m      
      kube-storage-version-migrator              4.17.0-rc.5   True        False         False      62m      
      monitoring                                 4.17.0-rc.5   True        False         False      20m      
      network                                    4.17.0-rc.5   True        False         False      69m      
      node-tuning                                4.17.0-rc.5   True        False         False      11m      
      openshift-apiserver                        4.17.0-rc.5   True        False         False      70m      
      openshift-controller-manager               4.17.0-rc.5   True        False         False      70m      
      openshift-samples                          4.17.0-rc.5   True        False         False      11m      
      operator-lifecycle-manager                 4.17.0-rc.5   True        False         False      70m      
      operator-lifecycle-manager-catalog         4.17.0-rc.5   True        False         False      70m      
      operator-lifecycle-manager-packageserver   4.17.0-rc.5   True        False         False      70m     
       service-ca                                 4.17.0-rc.5   True        False         False      62m      
      storage                                    4.17.0-rc.5   True        False         False      11m    
      
      % oc get clusterversion NAME      VERSION       AVAILABLE   PROGRESSING   SINCE   STATUS 
      version   4.17.0-rc.5   True        False         5m      Cluster version is 4.17.0-rc.5
      

              Unassigned Unassigned
              prubenda Paige Patton
              Paige Patton
              None
              Zhaohua Sun Zhaohua Sun
              None
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

                Created:
                Updated: