Uploaded image for project: 'RHEL'
  1. RHEL
  2. RHEL-1946

9.3 nightly images fail to poweroff: Kernel panic - not syncing: Attempted to kill init! exitcode=0x00000000

    • Icon: Bug Bug
    • Resolution: Duplicate
    • Icon: Undefined Undefined
    • None
    • rhel-9.3.0
    • kernel / Kernel-Core
    • Yes
    • Critical
    • Regression
    • rhel-sst-kernel-ft
    • ssg_core_kernel
    • None
    • False
    • Hide

      None

      Show
      None
    • None
    • None
    • Rejected Blocker
    • None
    • None
    • x86_64
    • None

      What were you trying to do that didn't work?

      Boot current cloud image and try to poweroff

      Please provide the package NVR for which bug is seen:

      kernel-5.14.0-362.el9.x86_64

      How reproducible: Always

      Steps to reproduce

      1. Boot http://download.devel.redhat.com/rhel-9/nightly/RHEL-9/latest-RHEL-9.3/compose/BaseOS/x86_64/images/rhel-guest-image-9.3-20230901.0.x86_64.qcow2 or https://cloud.centos.org/centos/9-stream/x86_64/images/CentOS-Stream-GenericCloud-9-latest.x86_64.qcow2 with some suitable cloud-init.iso; you are welcome to use cockpit CI's which provides root:foobar and admin:foobar accounts, but it's not magic in any way. You just need some cloud-init data to be able to log in at all.
      2. poweroff

      Expected results

      Machine powers off.

      Actual results

      Kernel oopses during late shutdown, complaining that init gets killed. Well yes, that's what shutdown does..

      [  OK  ] Reached target System Shutdown.    
      [  OK  ] Reached target Late Shutdown Services.    
      [  OK  ] Finished System Power Off.    
      [  OK  ] Reached target System Power Off.    
      [...]
      [  136.473094] systemd-shutdown[1]: Sending SIGKILL to remaining processes...    
      [  136.474582] systemd-shutdown[1]: Unmounting file systems.    
      [  136.474897] [4117]: Remounting '/' read-only with options 'seclabel,attr2,inode64,logbufs=8,logbsize=32k,noquota'.    
      [  136.479713] systemd-shutdown[1]: All filesystems unmounted.    
      [  136.485781] systemd-shutdown[1]: Powering off.    
      [  136.486908] ACPI: PM: Preparing to enter system sleep state S5    
      [  136.487425] kvm: exiting hardware virtualization    
      [  136.487730] reboot: Power down    
      [  136.488059] Kernel panic - not syncing: Attempted to kill init! exitcode=0x00000000    
      [  136.488581] CPU: 0 PID: 1 Comm: systemd-shutdow Kdump: loaded Not tainted 5.14.0-362.el9.x86_64 #1    
      [  136.489174] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.16.2-1.fc38 04/01/2014    
      [  136.489722] Call Trace:    
      [  136.490266]  <TASK>    
      [  136.490783]  dump_stack_lvl+0x34/0x48    
      [  136.491369]  panic+0xea/0x2e4    
      [  136.491860]  do_exit.cold+0x15/0x15    
      [  136.492411]  __do_sys_reboot+0x1e5/0x220    
      [  136.492952]  ? do_iter_readv_writev+0x124/0x190    
      [  136.493533]  ? do_iter_write+0x81/0x150    
      [  136.494065]  ? vfs_writev+0xcb/0x170    
      [  136.494570]  do_syscall_64+0x59/0x90    
      [  136.495064]  ? __x64_sys_rt_sigprocmask+0x76/0xd0    
      [  136.495611]  ? do_writev+0x6f/0x120    
      [  136.496069]  ? syscall_exit_to_user_mode+0x12/0x30    
      [  136.496607]  ? do_syscall_64+0x69/0x90    
      [  136.497067]  ? exc_page_fault+0x62/0x150    
      [  136.497539]  entry_SYSCALL_64_after_hwframe+0x72/0xdc    
      [  136.498083] RIP: 0033:0x7f9a8a3453e7    
      [  136.498524] Code: 0b 00 f7 d8 64 89 02 b8 ff ff ff ff eb b8 0f 1f 44 00 00 f3 0f 1e fa 89 fa be 69 19 12 28 bf ad de e1 fe b8 a9 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 01 c3 48 8b 15 09 4a 0b 00 f7 d8 64 89 02 b8    
      [  136.500059] RSP: 002b:00007ffcc34c4f38 EFLAGS: 00000202 ORIG_RAX: 00000000000000a9    
      [  136.500743] RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007f9a8a3453e7    
      [  136.501421] RDX: 000000004321fedc RSI: 0000000028121969 RDI: 00000000fee1dead    
      [  136.502087] RBP: 00007ffcc34c5140 R08: 0000000000000000 R09: 00007ffcc34c4330    
      [  136.502738] R10: 00007ffcc34c44f0 R11: 0000000000000202 R12: 0000000000000000    
      [  136.503360] R13: 00007ffcc34c4f88 R14: 00007ffcc34c4f90 R15: 0000000000000000
      [  136.503992]  </TASK>
      

      After that, it immediately reboots into a kdump boot:

      [    0.000000] Linux version 5.14.0-362.el9.x86_64 (mockbuild@x86-64-01.build.eng.rdu2.redhat.com) (gcc (GCC) 11.4.1 20230605 (Red Hat 11.4.1-2), GNU ld version 2.35.2-42.el9) #1 SMP PREEMPT_DYNAMIC Mon Aug 28 18:48:22 EDT 2023
      [...]
      [    1.817772] kdump[382]: Kdump is using the default log level(3).
      [    1.846962] kdump[418]: saving to /sysroot/var/crash/127.0.0.1-2023-09-01-02:07:42/
      [    1.852206] kdump[423]: saving vmcore-dmesg.txt to /sysroot/var/crash/127.0.0.1-2023-09-01-02:07:42/
      [    1.862219] kdump[429]: saving vmcore-dmesg.txt complete
      [    1.864143] kdump[431]: saving vmcore
      Copying data                                      : [100.0 %] /           eta: 0s
      [    2.127216] kdump.sh[432]: The dumpfile is saved to /sysroot/var/crash/127.0.0.1-2023-09-01-02:07:42//vmcore-incomplete.
      [    2.128155] kdump.sh[432]: makedumpfile Completed.
      [    2.136409] kdump[436]: saving vmcore complete
      [    2.138384] kdump[438]: saving the /run/initramfs/kexec-dmesg.log to /sysroot/var/crash/127.0.0.1-2023-09-01-02:07:42//
      [    2.153255] kdump[444]: Executing final action systemctl reboot -f
      

      .. and eventually ends up again in a booted system:

        OK  ] Finished Crash recovery kernel arming.
      
      Red Hat Enterprise Linux 9.3 Beta (Plow)
      Kernel 5.14.0-362.el9.x86_64 on an x86_64
      
      Activate the web console with: systemctl enable --now cockpit.socket
      
      rhel-9-3-127-0-0-2-2201 login:
      

      See https://github.com/cockpit-project/bots/issues/5169 – this started somewhere between rhel-guest-image-9.3-20230817.2.x86_64.qcow2 from roughly two weeks ago (which was fine), and rhel-guest-image-9.3-20230828.65.x86_64.qcow2 from 3 days ago (which was broken).

              core-kernel-mgr core-kernel mgr
              rhn-engineering-mpitt Martin Pitt
              core-kernel mgr core-kernel mgr
              Kernel QE Kernel QE
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

                Created:
                Updated:
                Resolved: