Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-43491

CAPI machine get running takes more than 30 mins for Azure

XMLWordPrintable

    • Moderate
    • None
    • Rejected
    • False
    • Hide

      None

      Show
      None

      Description of problem:

      Machine become to running need about 30 mins, and sometimes after machine is running, still have pending csrs.    

      Version-Release number of selected component (if applicable):

      4.18.0-0.nightly-2024-10-16-094159    

      How reproducible:

       Always   

      Steps to Reproduce:

      1. Create machineset with capi on azure
      ---
      apiVersion: cluster.x-k8s.io/v1beta1
      kind: Cluster
      metadata:
        name: zhsun-az17-jqb9n
        namespace: openshift-cluster-api
      spec:
        infrastructureRef:
          apiVersion: infrastructure.cluster.x-k8s.io/v1beta1
          kind: AzureCluster
          name: zhsun-az17-jqb9n
          namespace: openshift-cluster-api
      ---
      apiVersion: infrastructure.cluster.x-k8s.io/v1beta1
      kind: AzureMachineTemplate
      metadata:
        name: capi-azure-machine-template1
        namespace: openshift-cluster-api
      spec:
        template:
          spec:
            #allocatePublicIP: true
            #capacityReservationGroupID: /subscriptions/53b8f551-f0fc-4bea-8cba-6d1fefd54c8a/resourceGroups/ZHSUN-AZ18-RNSMV-RG/providers/Microsoft.Compute/capacityReservationGroups/zhsun-capacity
            #additionalCapabilities:
            #  ultraSSDEnabled: true
            #publicIP: true
            networkInterfaces:
            - privateIPConfigs: 1
              subnetName: zhsun-az17-jqb9n-worker-subnet
              acceleratedNetworking: true
            identity: None
            #failureDomain: "1"
            image:
              id: /subscriptions/53b8f551-f0fc-4bea-8cba-6d1fefd54c8a/resourceGroups/zhsun-az17-jqb9n-rg/providers/Microsoft.Compute/galleries/gallery_zhsun_az17_jqb9n/images/zhsun-az17-jqb9n-gen2/versions/latest
            osDisk:
              diskSizeGB: 128
              managedDisk:
                storageAccountType: Standard_LRS
              osType: Linux
            vmSize: Standard_D4s_v3
            userDataSecret:
              name: worker-user-data
      ---
      apiVersion: cluster.x-k8s.io/v1beta1
      kind: MachineSet
      metadata:
        name: capi-ms1
        namespace: openshift-cluster-api
      spec:
        clusterName: zhsun-az17-jqb9n
        replicas: 1
        selector:
          matchLabels:
            machine.openshift.io/cluster-api-cluster: zhsun-az17-jqb9n
        template:
          metadata:
            labels:
              machine.openshift.io/cluster-api-cluster: zhsun-az17-jqb9n
          spec:
            ultraSSDCapability: "Enabled"
            bootstrap:
               dataSecretName: worker-user-data
            clusterName: zhsun-az17-jqb9n
            infrastructureRef:
              apiVersion: infrastructure.cluster.x-k8s.io/v1beta1
              kind: AzureMachineTemplate
              name: capi-azure-machine-template1
      2.
      3.
          

      Actual results:

       $ oc get machine -w                   
      NAME             CLUSTER            NODENAME   PROVIDERID   PHASE     AGE   VERSION
      capi-ms1-rhtks   zhsun-az17-jqb9n                           Pending   16s
      capi-ms1-rhtks   zhsun-az17-jqb9n                           Pending   28s
      capi-ms1-rhtks   zhsun-az17-jqb9n                           Pending   2m4s
      capi-ms1-rhtks   zhsun-az17-jqb9n                           Pending   27m
      capi-ms1-rhtks   zhsun-az17-jqb9n                           Pending   27m
      capi-ms1-rhtks   zhsun-az17-jqb9n                           Pending   27m
      capi-ms1-rhtks   zhsun-az17-jqb9n              azure:///subscriptions/53b8f551-f0fc-4bea-8cba-6d1fefd54c8a/resourceGroups/zhsun-az17-jqb9n-rg/providers/Microsoft.Compute/virtualMachines/capi-ms1-rhtks   Pending   27m
      capi-ms1-rhtks   zhsun-az17-jqb9n              azure:///subscriptions/53b8f551-f0fc-4bea-8cba-6d1fefd54c8a/resourceGroups/zhsun-az17-jqb9n-rg/providers/Microsoft.Compute/virtualMachines/capi-ms1-rhtks   Provisioned   27m
      capi-ms1-rhtks   zhsun-az17-jqb9n              azure:///subscriptions/53b8f551-f0fc-4bea-8cba-6d1fefd54c8a/resourceGroups/zhsun-az17-jqb9n-rg/providers/Microsoft.Compute/virtualMachines/capi-ms1-rhtks   Provisioned   27m
      capi-ms1-rhtks   zhsun-az17-jqb9n              azure:///subscriptions/53b8f551-f0fc-4bea-8cba-6d1fefd54c8a/resourceGroups/zhsun-az17-jqb9n-rg/providers/Microsoft.Compute/virtualMachines/capi-ms1-rhtks   Provisioned   37m
      capi-ms1-rhtks   zhsun-az17-jqb9n   capi-ms1-rhtks   azure:///subscriptions/53b8f551-f0fc-4bea-8cba-6d1fefd54c8a/resourceGroups/zhsun-az17-jqb9n-rg/providers/Microsoft.Compute/virtualMachines/capi-ms1-rhtks   Running       37m    
      
      must-gather: https://drive.google.com/file/d/1yAg__s-3j1o4n4VNIGk1BPUOUS_DDPjh/view?usp=sharing
      
      Another cluster, after machine is running, csr pending 
      $ oc get csr                      
      NAME        AGE     SIGNERNAME                                    REQUESTOR                                                                   REQUESTEDDURATION   CONDITION
                                                  <none>              Approved,Issued
      csr-d4bnd   5h10m   kubernetes.io/kube-apiserver-client-kubelet   system:serviceaccount:openshift-machine-config-operator:node-bootstrapper   <none>              Pending
      csr-d7x29   5h10m   kubernetes.io/kube-apiserver-client-kubelet   system:serviceaccount:openshift-machine-config-operator:node-bootstrapper   <none>              Pending
      
      machine-approver log: https://drive.google.com/file/d/1eikypy6Etej0SgSuGwvwiRt-UcHezm2t/view?usp=sharing 

      Expected results:

      Machine can be quickly join the cluster    

      Additional info:

          

              ddonati@redhat.com Damiano Donati
              rhn-support-zhsun Zhaohua Sun
              Zhaohua Sun Zhaohua Sun
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

                Created:
                Updated: