Uploaded image for project: 'AI Platform Core Components'
  1. AI Platform Core Components
  2. AIPCC-866

RHEL AI 1.4.3 release bootc containers result in kernel panic

    • Icon: Bug Bug
    • Resolution: Duplicate
    • Icon: Undefined Undefined
    • None
    • None
    • AIPCC Productization
    • None
    • False
    • Hide

      None

      Show
      None
    • False

      To Reproduce Steps to reproduce the behavior:

      1. Deploy RHEL AI 1.4 onto AWS
      2. bootc upgrade --apply OR bootc switch "registry.redhat.io/rhelai1/bootc-aws-nvidia-rhel9:1.4.3-1743101756"
      3. Observe kernel panic after reboot

      Expected behavior

      • Bootup of the upgraded bootc image

      Screenshots

      • N/A

      Device Info (please complete the following information):

      • Hardware Specs: AWS (verified on g6 and p5)
      • OS Version: RHEL AI 1.4.3 Prod/Release
      • InstructLab Version: 
      • Provide the output of these two commands:
        •  

      Bug impact

      • System is rendered unusable

      Known workaround

      • N/A

      Additional context

      Stage builds worked/work just fine, the issue was verified on AWS prod/release bootc containers only so far.

      [  110.320291] md: Waiting for all devices to be available before autodetect
      [  110.348731] md: If you don't use raid, use raid=noautodetect
      [  110.375634] md: Autodetecting RAID arrays.
      [  110.394654] md: autorun ...
      [  110.407907] md: ... autorun DONE.
      [  110.423840] List of all partitions:
      [  110.440632] No filesystem could mount root, tried: 
      [  110.440633] 
      [  110.470689] Kernel panic - not syncing: VFS: Unable to mount root fs on unknown-block(0,0)
      [  110.512464] CPU: 8 PID: 1 Comm: swapper/0 Not tainted 5.14.0-427.55.1.el9_4.x86_64 #1
      [  110.550228] Hardware name: Amazon EC2 g6.48xlarge/, BIOS 1.0 10/16/2017
      [  110.580257] Call Trace:
      [  110.591320]  <TASK>
      [  110.601471]  dump_stack_lvl+0x34/0x48
      [  110.619104]  panic+0x107/0x2f7
      [  110.633671]  mount_root_generic+0x1bc/0x1cf
      [  110.654074]  ? create_io_worker+0x140/0x1a0
      [  110.673921]  ? mount_root+0x134/0x176
      [  110.691122]  prepare_namespace+0x1ae/0x1f5
      [  110.711060]  kernel_init_freeable+0x186/0x1ab
      [  110.731779]  ? __pfx_kernel_init+0x10/0x10
      [  110.751160]  kernel_init+0x16/0x130
      [  110.768017]  ret_from_fork+0x2c/0x50
      [  110.785215]  </TASK>
      [  110.799139] Kernel Offset: 0x2da00000 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffffbfffffff)
      [  110.849722] ---[ end Kernel panic - not syncing: VFS: Unable to mount root fs on unknown-block(0,0) ]---

              Unassigned Unassigned
              fzatlouk@redhat.com František Zatloukal
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated:
                Resolved: