Uploaded image for project: 'OpenShift Hosted Control Plane'
  1. OpenShift Hosted Control Plane
  2. HOSTEDCP-1427

Z Compute nodes not getting detached after scaling nodepool to 0

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Done
    • Icon: Major Major
    • None
    • None
    • None
    • None
    • False
    • None
    • False
    • 0
    • 0
    • 0

      Description of problem:

      Compute nodes taking too long  (more than an hour ) to detach from the Cluster , after scaling the nodepool to 0 

      Version-Release number of selected component (if applicable):

      MCE 2.4.3

      MCE 2.5 ( all builds ) 

      How reproducible:

      Steps to Reproduce:

      1. Install MCE on a management cluster 
      2. Create AgentServiceConfig , HCP and InfraEnv 
      3. Attach compute nodes ( s390x ) .
      4. Scale down the nodepool to 0 ( Deletion procedure ) 

      Actual results:

      Nodes are getting detached after taking long time ( more than an hour ) 

      Expected results:

      Nodes should be detached from HCP after scaling the Nodepool to 0

      Additional info:

      Inputs after debug: 

      After scaling in the nodepool to 0 from 2 , 2 pods are getting created in  assisted-installer namespace for reclaiming nodes  and they are going to imagepullbackoff ( not able to pull the image from registry.redhat.io ).

      # oc get po -A
      NAMESPACE                                          NAME                                                      READY   STATUS             RESTARTS      AGE
      assisted-installer                                 worker-0-agent-hcp-ci-com-reclaim-t4qhq                   0/1     ImagePullBackOff   0             2m58s
      assisted-installer                                 worker-1-agent-hcp-ci-com-reclaim-d68c6                   0/1     ImagePullBackOff   0             2m38s 
      Events:   Type     Reason          Age                   From               Message   ----     ------          ----                  ----               -------   Normal   Scheduled       4m8s                  default-scheduler  Successfully assigned assisted-installer/worker-0-agent-hcp-ci-com-reclaim-t4qhq to worker-0.agent-hcp.ci.com   Normal   AddedInterface  4m9s                  multus             Add eth0 [10.132.0.25/23] from ovn-kubernetes   Warning  Failed          2m46s (x6 over 4m8s)  kubelet            Error: ImagePullBackOff   Normal   Pulling         2m33s (x4 over 4m9s)  kubelet            Pulling image "registry.redhat.io/multicluster-engine/assisted-installer-agent-rhel9@sha256:5172591a3e9fa2c39dcc410eb28bfc77e33932687662ba484d94a3a3436177c2"   Warning  Failed          2m32s (x4 over 4m8s)  kubelet            Failed to pull image "registry.redhat.io/multicluster-engine/assisted-installer-agent-rhel9@sha256:5172591a3e9fa2c39dcc410eb28bfc77e33932687662ba484d94a3a3436177c2": reading manifest sha256:5172591a3e9fa2c39dcc410eb28bfc77e33932687662ba484d94a3a3436177c2 in registry.redhat.io/multicluster-engine/assisted-installer-agent-rhel9: unauthorized: access to the requested resource is not authorized   Warning  Failed          2m32s (x4 over 4m8s)  kubelet            Error: ErrImagePull   Normal   BackOff         2m19s (x7 over 4m8s)  kubelet            Back-off pulling image "registry.redhat.io/multicluster-engine/assisted-installer-agent-rhel9@sha256:5172591a3e9fa2c39dcc410eb28bfc77e33932687662ba484d94a3a3436177c2"
      

              cchun@redhat.com Crystal Chun
              rh-ee-dveerabh Damisetti Veerabhadra
              David Huynh David Huynh
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

                Created:
                Updated:
                Resolved: