Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-60876

Day2 firmware update on Dell failed due to idrac reboot error

XMLWordPrintable

    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • 3
    • Critical
    • None
    • None
    • None
    • Rejected
    • Metal Platform 276, Metal Platform 277, Metal Platform 278, Metal Platform 279
    • 4
    • Proposed
    • Known Issue
    • Hide
      An insufficient user delay setting in the BIOS can cause the iDRAC firmware update to fail. This results in an iDRAC reboot error and service interruption. To resolve this problem, disable SR-IOV and increase the AC Power Recovery User Delay. Reboot the system and attempt the update again.
      Show
      An insufficient user delay setting in the BIOS can cause the iDRAC firmware update to fail. This results in an iDRAC reboot error and service interruption. To resolve this problem, disable SR-IOV and increase the AC Power Recovery User Delay. Reboot the system and attempt the update again.
    • None
    • None
    • None
    • None

      Description of problem:

      
      Running a day2 update on my Dell xr8620t failed with the following error:
      
        errorMessage: 'Firmware update failed for node fd8e8d2c-8042-475e-b276-c19ff7453013,
          firmware http://hv14.telco5gran.eng.rdu2.redhat.com:8888/firmware/xr8620t/iDRAC-with-Lifecycle-Controller_Firmware_WN31M_WN64_7.20.30.50_A00.EXE.
          Error: Unable to complete the job because of an error with iDRAC reboot during
          iDRAC firmware update.'
        errorType: servicing error
      
      The idrac shows the following lifecycle logs:
      
      2025-08-26 07:14:26	JCP042	Job JID_562100372172 failed because Unable to complete the job because of an error with iDRAC reboot during iDRAC firmware update.	
      Log Sequence Number:1319704
      Detailed Description:The (installation or configuration) job identified in the message cannot be run. For more information, see the Job Queue.
      Recommended Action:Review the LC Log for related update, configuration, or collection job errors. Retry the operation. If the problem persists, contact your service provider.
      
      2025-08-26 07:14:26	RED108	Unable to complete the job because of an error with iDRAC reboot during iDRAC firmware update.	
      Log Sequence Number:1319703
      Detailed Description:The operation cannot be completed because of an error with iDRAC reboot during iDRAC firmware update.
      Recommended Action:Retry the operation. If the issue persists, contact your service provider.
      
      
      Starting point:
      
      $ oc get hostfirmwarecomponents.metal3.io -n xr8620t-pool xr8620txdg22 -o json | jq '.status.components'
      [
        {
          "component": "bios",
          "currentVersion": "2.4.4",
          "initialVersion": "2.6.3",
          "lastVersionFlashed": "2.4.4",
          "updatedAt": "2025-08-23T00:13:51Z"
        },
        {
          "component": "bmc",
          "currentVersion": "7.10.90.00",
          "initialVersion": "7.20.30.50",
          "lastVersionFlashed": "7.10.90.00",
          "updatedAt": "2025-08-23T00:13:51Z"
        },
        {
          "component": "nic:NIC.Slot.2",
          "currentVersion": "23.0.8",
          "initialVersion": "23.0.8"
        },
        {
          "component": "nic:NIC.Slot.3",
          "currentVersion": "23.0.8",
          "initialVersion": "23.0.8"
        }
      ]
      $ oc get hostfirmwaresettings.metal3.io -n xr8620t-pool xr8620txdg22 -o json | jq '.status.settings
                | {
                    ("SriovGlobalEnable"):(.SriovGlobalEnable),
                    ("AcPwrRcvryUserDelay"):(.AcPwrRcvryUserDelay)
                  }'
      {
        "SriovGlobalEnable": "Enabled",
        "AcPwrRcvryUserDelay": "150"
      }
      
      
      Update was to set the following:
      
        bios:
          attributes:
            AcPwrRcvryUserDelay: 180
      
        biosFirmware:
          url: http://hv14.telco5gran.eng.rdu2.redhat.com:8888/firmware/xr8620t/BIOS_XK9CT_WN64_2.6.3_05.EXE
          version: 2.6.3
        bmcFirmware:
          url: http://hv14.telco5gran.eng.rdu2.redhat.com:8888/firmware/xr8620t/iDRAC-with-Lifecycle-Controller_Firmware_WN31M_WN64_7.20.30.50_A00.EXE
          version: 7.20.30.50
      
      
      The BIOS setting was updated, and the BIOS firmware was successfully upgraded.
      
          

      Version-Release number of selected component (if applicable):

      OCP: 4.20.0-0.nightly-2025-08-12-153542
      
          

      How reproducible:

      Intermittent. I was able to successfully update another xr8620t at the same time, with similar change set.
      
          

      Steps to Reproduce:

          1.
          2.
          3.
          

      Actual results:

      
          

      Expected results:

      
          

      Additional info:

      
          

              janders@redhat.com Jacob Anders
              dpenney1@redhat.com Don Penney
              None
              None
              Jad Haj Yahya Jad Haj Yahya
              Katie Drake Katie Drake
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

                Created:
                Updated: