Uploaded image for project: 'RHEL'
  1. RHEL
  2. RHEL-36675

Cannot deploy FIPS enabled OCP cluster

    • Icon: Bug Bug
    • Resolution: Done
    • Icon: Undefined Undefined
    • None
    • rhel-8.6.0.z
    • python3
    • None
    • None
    • None
    • rhel-sst-pt-python-ruby-nodejs
    • ssg_core_services
    • None
    • False
    • Hide

      None

      Show
      None
    • None
    • None
    • None
    • None
    • None

          When enabling FIPS OCP deployment fails

       
      OCP installation failing with FIPS mode (both using IPI and assisted installer) v4.15
      for IPI job it fails with bellow error:
       
      lapsed]", "level=debug msg=ironic_node_v1.openshift-master-host[2]: Still creating... [59m40s elapsed]", "level=debug msg=ironic_node_v1.openshift-master-host[0]: Still creating... [59m40s elapsed]", "level=debug msg=ironic_node_v1.openshift-master-host[1]: Still creating... [59m50s elapsed]", "level=debug msg=ironic_node_v1.openshift-master-host[2]: Still creating... [59m50s elapsed]", "level=debug msg=ironic_node_v1.openshift-master-host[0]: Still creating... [59m50s elapsed]", "level=error", "level=error msg=Error: could not contact Ironic API: timeout reached", "level=error", "level=error msg= with ironic_node_v1.openshift-master-host[1],", "level=error msg= on main.tf line 13, in resource \"ironic_node_v1\" \"openshift-master-host\":", "level=error msg= 13: resource \"ironic_node_v1\" \"openshift-master-host\" {", "level=error", "level=error", "level=error msg=Error: could not contact Ironic API: context deadline exceeded", "level=error", "level=error msg= with ironic_node_v1.openshift-master-host[2],", "level=error msg= on main.tf line 13, in resource \"ironic_node_v1\" \"openshift-master-host\":", "level=error msg= 13: resource \"ironic_node_v1\" \"openshift-master-host\" {", "level=error", "level=error", "level=error msg=Error: could not contact Ironic API: timeout reached", "level=error", "level=error msg= with ironic_node_v1.openshift-master-host[0],", "level=error msg= on main.tf line 13, in resource \"ironic_node_v1\" \"openshift-master-host\":", "level=error msg= 13: resource \"ironic_node_v1\" \"openshift-master-host\"

      {", "level=error", "level=error msg=failed to fetch Cluster: failed to generate asset \"Cluster\": failed to create cluster: failure applying terraform for \"masters\" stage: error applying Terraform configs: failed to apply Terraform: error(BaremetalIronicAPITimeout) from Infrastructure Provider: Unable to the reach provisioning service. This failure can be caused by incorrect network/proxy settings, inability to download the machine operating system images, or other misconfiguration. Please check access to the bootstrap host, and for any failing services."], "stdout": "", "stdout_lines": []}

      For more info - link to must-gather:
      https://s3.upshift.redhat.com/DH-PROD-OCP-EDGE-QE-CI/Infra/must-gather/4504/index.html
      on spoke cluster it fails on finalizing stage
      2 out of 4 containers of metal3 are  failing
       
      Warning BackOff 59m kubelet Back-off restarting failed container metal3-ironic-inspector in pod metal3-6cbff67b6c-6gblc_openshift-machine-api(e88416d0-d736-461e-9bd7-bb734c3283e4) Warning BackOff 66s (x289 over 59m) kubelet Back-off restarting failed container metal3-ironic in pod metal3-6cbff67b6c-6gblc_openshift-machine-api(e88416d0-d736-461e-9bd7-bb734c3283e4)
      with:
       
      + CONFIG_OPTIONS+=' --config-file /auth/ironic/auth-config' + ironic-inspector-dbsync --config-file /etc/ironic-inspector/ironic-inspector.conf upgrade Traceback (most recent call last): File "/usr/bin/ironic-inspector-dbsync", line 6, in <module> from ironic_inspector.cmd.dbsync import main File "<frozen importlib._bootstrap>", line 1007, in _find_and_load File "<frozen importlib._bootstrap>", line 986, in _find_and_load_unlocked File "<frozen importlib._bootstrap>", line 680, in _load_unlocked File "<frozen importlib._bootstrap_external>", line 846, in exec_module File "<frozen importlib._bootstrap_external>", line 989, in get_code TypeError: source_hash() missing required argument 'source' (pos 2)
      Version-Release number of selected component (if applicable):

          4.15

      How reproducible:

          Always

      Steps to Reproduce:

      Deploy OCP v4.15 with FIPS
          

      Actual results:

          Installation fails on timeout 

      Expected results:

          Installation should finish with success

      Additional info:

          For more info - link to must-gather:
      https://s3.upshift.redhat.com/DH-PROD-OCP-EDGE-QE-CI/Infra/must-gather/4504/index.html

              python-maint python-maint
              vkolodny@redhat.com Vladislav Kolodny
              python-maint python-maint
              bot rhel-cs-apps-subsystem-qe bot rhel-cs-apps-subsystem-qe
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

                Created:
                Updated:
                Resolved: