Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-56048

OpenShift IPI virtulmedia dual stack failed on HPE arm servers

XMLWordPrintable

    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • 3
    • Critical
    • None
    • None
    • None
    • Rejected
    • Metal Platform 271, Metal Platform 272, Metal Platform 273
    • 3
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      Description of problem:

        bash-5.1$ oc get bmh -A
      NAMESPACE               NAME        STATE          CONSUMER                              ONLINE   ERROR   AGE
      openshift-machine-api   master-00   provisioned    ci-op-1391w5wr-wcfsw-master-0         true             170m
      openshift-machine-api   master-01   provisioned    ci-op-1391w5wr-wcfsw-master-1         true             170m
      openshift-machine-api   master-02   provisioned    ci-op-1391w5wr-wcfsw-master-2         true             170m
      openshift-machine-api   worker-00   provisioned    ci-op-1391w5wr-wcfsw-worker-0-p9zn2   true             170m
      openshift-machine-api   worker-01   provisioning   ci-op-1391w5wr-wcfsw-worker-0-pgztm   true             170m
      
      Only 2 node are available:
      master-00.ci-op-1391w5wr.ocpqe.arm.eng.rdu2.redhat.com   Ready      control-plane,master   137m   v1.32.4
      master-01.ci-op-1391w5wr.ocpqe.arm.eng.rdu2.redhat.com   Ready      control-plane,master   135m   v1.32.4
      worker-01.ci-op-1391w5wr.ocpqe.arm.eng.rdu2.redhat.com   NotReady   worker                 17m    v1.32.4
      
      
      Artifacts of failed job:
      https://gcsweb-qe-private-deck-ci.apps.ci.l2s4.p1.openshiftapps.com/gcs/qe-private-deck/pr-logs/pull/openshift_release/63939/rehearse-63939-periodic-ci-openshift-openshift-tests-private-release-4.19-multi-nightly-baremetal-ipi-ovn-dualstack-arm-vmedia-f7-test/1921896364287987712/artifacts/
      
      Note: job took 4 hours since there was a wait step 
      Same job on AMD is working fine 

      Version-Release number of selected component (if applicable):

          

      How reproducible:

          Always

      Steps to Reproduce:

          1. Deploy periodic-ci-openshift-openshift-tests-private-release-4.19-multi-nightly-baremetal-ipi-ovn-dualstack-arm-vmedia-f7     
          2.
          3.
          

      Actual results:

       Failed

      Expected results:

      Should pass    
      
      

      Additional info:

      arm servers in our lab are ProLiant RL300 Gen11 with ILO6

      Relevant logs from kubelet journal:
      May 12 13:51:37 worker-01.ci-op-1391w5wr.ocpqe.arm.eng.rdu2.redhat.com kubenswrapper[5578]: I0512 13:51:37.282466    5578 kubelet_node_status.go:78] "Attempting to register node" node="worker-01.ci-op-1391w5wr.ocpqe.arm.eng.rdu2.redhat.com"
      May 12 13:51:37 worker-01.ci-op-1391w5wr.ocpqe.arm.eng.rdu2.redhat.com kubenswrapper[5578]: E0512 13:51:37.300304    5578 kubelet_node_status.go:116] "Unable to register node with API server, error getting existing node" err="nodes \"worker-01.ci-op-1391w5wr.ocpqe.arm.eng.rdu2.redhat.com\" is forbidden: User \"system:anonymous\" cannot get resource \"nodes\" in API group \"\" at the cluster scope" node="worker-01.ci-op-1391w5wr.ocpqe.arm.eng.rdu2.redhat.com"
      5:07
      No valid client certificate is found but the server is not responsive. A restart may be necessary to retrieve new initial credentials." lastCertificateAvailabilityTime="2025-05-12 13:42:16.129168335 +0000 UTC m=+0.090883429" shutdownThreshold="5m0s"
       
       

              rhn-engineering-hpokorny Honza Pokorny
              jadha Jad Haj Yahya
              None
              None
              Steeve Goveas Steeve Goveas
              None
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

                Created:
                Updated:
                Resolved: