Uploaded image for project: 'Red Hat OpenStack Services on OpenShift'
  1. Red Hat OpenStack Services on OpenShift
  2. OSPRH-8747

BMH cannot be provisioned with OCP 4.16

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Done
    • Icon: Critical Critical
    • None
    • None
    • None
    • False
    • Hide

      None

      Show
      None
    • False
    • ?
    • ?
    • ?
    • ?
    • Yes
    • Critical

      When using a deployment based on OCP 4.16 the dataplane deployment stuck provisioning the baremetal hosts:

      [zuul@controller-0 ~]$ oc get bmh -A
      NAMESPACE               NAME                 STATE          CONSUMER             ONLINE   ERROR                AGE
      openshift-machine-api   compute-0            provisioning   openstack-edpm       true                          98m
      openshift-machine-api   compute-1            provisioning   openstack-edpm       true                          98m
      openshift-machine-api   openshift-master-0   provisioned    ocp-fd9vq-master-0   true     registration error   3h1m
      openshift-machine-api   openshift-master-1   provisioned    ocp-fd9vq-master-1   true     registration error   3h1m
      openshift-machine-api   openshift-master-2   provisioned    ocp-fd9vq-master-2   true     registration error   3h1m

      The problem seems to be here:

      [root@10 ~]# podman logs ironic-agent 2>&1 |grep ERROR
      2024-07-17 12:22:01.937 1 DEBUG ironic_python_agent.cmd.agent [-] logging_exception_prefix       = %(asctime)s.%(msecs)03d %(process)d ERROR %(name)s %(instance)s log_opt_values /usr/lib/python3.9/site-packages/oslo_config/cfg.py:2606
      2024-07-17 12:29:16.913 1 ERROR ironic_python_agent.inspector [-] inspector https://172.23.0.3:5050/v1/continue error 400: {"error":{"message":"Node 22dfe333-17f6-494a-84c4-d78b484cc135 is not active, its provision state is clean wait"}}
      2024-07-17 12:29:16.921 1 ERROR ironic_python_agent.agent [-] Failed to perform inspection: stopping inspection, as inspector returned an error: ironic_python_agent.errors.InspectionError: stopping inspection, as inspector returned an error
      
      [zuul@controller-0 ~]$ oc -nopenstack get openstackbaremetalset -oyaml |grep -i clean
          automatedCleaningMode: metadata

      Cluster version:

      [zuul@controller-0 ~]$ oc get clusterversion
      NAME      VERSION   AVAILABLE   PROGRESSING   SINCE   STATUS
      version   4.16.0    True        False         150m    Cluster version is 4.16.0

      iDRAC info:

      iDRAC 8
      PowerEdge R730
      BIOS version: 2.16.0
      Firmware version: 2.82.82.82
      

       

            Unassigned Unassigned
            rdiazcam@redhat.com Ricardo Diaz Campos
            rhos-dfg-df
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

              Created:
              Updated:
              Resolved: