Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-48793

Telco : Toolbox fails with open pidfd: Invalid argument and exits with code 129 when run through a debug pod

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Cannot Reproduce
    • Icon: Major Major
    • None
    • 4.18.z
    • Containers
    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • None
    • None
    • Yes
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      Description of problem:

      When using the toolbox, the process fails with the following log entries:
      2025-01-21T17:36:02.443670Z: open pidfd: Invalid argument  
      /usr/bin/toolbox: line 179: 874734 Hangup sudo podman attach "${TOOLBOX_NAME}"  
      --- SIGCHLD {si_signo=SIGCHLD, si_code=CLD_KILLED, si_pid=874734, si_uid=0, si_status=SIGHUP, si_utime=0, si_stime=1} ---  
      --- SIGCHLD {si_signo=SIGCHLD, si_code=CLD_EXITED, si_pid=995195, si_uid=0, si_status=0, si_utime=0, si_stime=0} ---  
      +++ exited with 129 +++  
      
      The issue appears to be related to the open pidfd: Invalid argument error. The toolbox process is terminated prematurely with a SIGHUP signal and subsequently exits with code 129.
      Steps to Reproduce:
      
      
      Expected Result:
      
      The toolbox should complete its operation without errors or premature termination.
      Actual Result:
      
      The process fails with the open pidfd: Invalid argument error and exits with code 129.
      Environment:
      
      Toolbox version: harbor.jointlab.fx.nsn-rdnet.net:443/ocp/images/rhel9/support-tools:latest 
      Sos version: sos report (version 4.8.1)
      Podman version: [Specify version]
      Operating System: Red Hat Enterprise Linux CoreOS 418.94.202412120450-0 Kernel version: 5.14.0-427.49.1.el9_4.x86_64
      
      Additional Information:
      
      Running the same steps through a ssh connection, the issue is not replicable.
      
      
      Attachments:
      
      Full logs (if available)
      Environment details or configuration files
      
      

      Version-Release number of selected component (if applicable):

          4.18.0-rc.2 / Red Hat Enterprise Linux CoreOS 418.94.202412120450-0 / 5.14.0-427.49.1.el9_4.x86_64

      How reproducible:

      100%

      Steps to Reproduce:

          1.$ oc debug node/master0
          2.# chroot /host
          3.# toolbox 
          4.# sos report --all-logs --batch --alloptions --debug 
          5. Observe the logs during execution.   --> 2025-01-21T17:36:02.443670Z: open pidfd: Invalid argument  

      Actual results:

          2025-01-21T17:36:02.443670Z: open pidfd: Invalid argument

      Expected results:

          The container its not getting killed and the sos archive its being created

      Additional info:

          

              tsweeney@redhat.com Tom Sweeney
              midu@redhat.com Mihai IDU
              None
              None
              Cameron Meadors Cameron Meadors
              None
              Votes:
              0 Vote for this issue
              Watchers:
              10 Start watching this issue

                Created:
                Updated:
                Resolved: