Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-28244

[ASH] cluster install fails nodes stuck in node.cloudprovider.kubernetes.io/uninitialized

    XMLWordPrintable

Details

    • Critical
    • Yes
    • CLOUD Sprint 249, CLOUD Sprint 250, CLOUD Sprint 251
    • 3
    • Approved
    • False
    • Hide

      None

      Show
      None

    Description

      Description of problem:

      Cluster install fails on ASH, nodes tainted with node.cloudprovider.kubernetes.io/uninitialized=true:NoSchedule

      Version-Release number of selected component (if applicable):

      4.16.0-0.nightly-2024-01-24-133352    

      How reproducible:

      Always    

      Steps to Reproduce:

      1. Built a cluster on ASH  
      $ oc get node            
      NAME                             STATUS     ROLES                  AGE    VERSION
      ropatil-261ash1-x9kcj-master-0   NotReady   control-plane,master   7h     v1.29.1+0e0d15b
      ropatil-261ash1-x9kcj-master-1   NotReady   control-plane,master   7h1m   v1.29.1+0e0d15b
      ropatil-261ash1-x9kcj-master-2   NotReady   control-plane,master   7h1m   v1.29.1+0e0d15b 
      
      $ oc get node -o yaml | grep uninitialized    
            key: node.cloudprovider.kubernetes.io/uninitialized
            key: node.cloudprovider.kubernetes.io/uninitialized
            key: node.cloudprovider.kubernetes.io/uninitialized  
      
      $ oc get po -n openshift-cloud-controller-manager     
      NAME                                              READY   STATUS             RESTARTS         AGE
      azure-cloud-controller-manager-7b75cbbd64-qzhmm   0/1     CrashLoopBackOff   43 (20s ago)     4h54m
      azure-cloud-controller-manager-7b75cbbd64-w5cl8   1/1     Running            70 (2m52s ago)   7h33m
      azure-cloud-node-manager-9r8gb                    0/1     CrashLoopBackOff   93 (79s ago)     7h33m
      azure-cloud-node-manager-jn8lv                    0/1     CrashLoopBackOff   93 (82s ago)     7h33m
      azure-cloud-node-manager-n4vt4                    0/1     CrashLoopBackOff   93 (102s ago)    7h33m  
      
      $ oc -n openshift-cloud-controller-manager logs -f azure-cloud-controller-manager-7b75cbbd64-w5cl8 -c cloud-controller-manager     Error from server: no preferred addresses found; known addresses: []     

      Actual results:

      Cluster install failed on ASH 

      Expected results:

      Cluster install succeed on ASH    

      Additional info:

      log-bundle: https://drive.google.com/file/d/1QQwyQ1MxuunZx6AXqOTt6KwYwUk2GW7R/view?usp=sharing 

      Attachments

        Activity

          People

            rh-ee-tbarberb Theo Barber-Bany
            rhn-support-zhsun Zhaohua Sun
            Zhaohua Sun Zhaohua Sun
            Votes:
            0 Vote for this issue
            Watchers:
            12 Start watching this issue

            Dates

              Created:
              Updated: