Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-7799

[OCP 4.10] Baremetal IPI worker scaleup failed, "snponly.efi" file missing

    XMLWordPrintable

Details

    • Moderate
    • No
    • 5
    • False
    • Hide

      None

      Show
      None

    Description

      Description of problem:

      A new node was being scale up in a working 4.10 baremetal IPI environment but it failed:
      - The node doesn't manage to boot using PXE boot because the TFTP server doesn't have the file "snponly.efi"
      - Checking the "metal3-dnsmasq" container in the "/shared/tftpboot" that file doesn't exist and it is confirmed by the tftp logs of the same container:~~~
      sh-4.4# ls /shared/tftpboot
      0c818e3b-5bf1-5f35-a2d3-2a8ccb960ee9.converted    272442a2-0552-5048-996a-47c38a0a0ff2.converted    a4f83807-0b44-5668-abd6-e7aaa2cdf9c7.converted    ee28374a-26bc-5740-94e7-2fc4b15a4b76.converted
      13f119af-710b-566b-ad6b-b2f7e2db3707.converted    53c02722-4914-5722-8a72-411d27c60307.converted    a6fba09d-5734-5169-8a50-1fcb4a9bfe69.converted    fe56fdb6-a571-5e9a-b311-ecb02d0f64c2.converted
      190c87f1-c6a9-56dc-9a73-ab9aa0fc40b9.converted    7f1b22ee-81cc-59f9-8109-7280bd31ec83.converted    aa5a365a-7e6d-5666-b796-63173b990977.converted
      1a6acf62-c6ac-5e0f-85f0-0be804b60a3e.converted    94a071a3-de9a-5fe8-88a0-7a2114288e9a.converted    b5410760-a673-5e76-810b-6df0c9dc6359.converted
      237064bf-c040-59e5-955f-90644d4bbde5.converted    9ddd6dd9-e074-538f-9d34-0bace0361b1f.converted    e62d2dc9-646a-58d8-83e3-b2b8ec926453.converted
      sh-4.4# exit
      ~~~
      Logs from dnsmaq-dhcp says:
      ~~~
      dnsmasq-tftp: file /shared/tftpboot/snponly.efi not found
      ~~~Screenshot of a few logs taken during the process is attachedDeleting the pod and letting it recreate solves the issue
      

      Version-Release number of selected component (if applicable):

      Openshift 4.10

      How reproducible:

      not able to reproduce

      Steps to Reproduce:

      1.
      2.
      3.
      

      Actual results:

      The TFTP ipxe files were mysteriously missing so that makes the deployment failed.
      

      Expected results:

      No missing files

      Additional info:

      The workaround was to delete the pod and the recreation of it manages to replace the missing files.

      Attachments

        Activity

          People

            janders@redhat.com Jacob Anders
            rhn-support-mabajodu Mario Abajo Duran
            Jad Haj Yahya Jad Haj Yahya
            Mario Abajo Duran
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: