Uploaded image for project: 'RHEL'
  1. RHEL
  2. RHEL-118803

[Task make test] Kernel crash in mdadm due to invalid bitmap_get_stats.

Linking RHIVOS CVEs to...Migration: Automation ...SWIFT: POC ConversionSync from "Extern...XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Normal Normal
    • None
    • rhel-9.6
    • mdadm
    • None
    • Yes
    • Low
    • rhel-storage-crs
    • None
    • False
    • False
    • Hide

      None

      Show
      None
    • None
    • None
    • None
    • None
    • Unspecified
    • Unspecified
    • Unspecified
    • x86_64
    • None

      What were you trying to do that didn't work?

      RHEL 9.6 fails to boot and the kernel crashes when using "Intel VROC RAID."

      What is the impact of this issue to you?

      System not booting.

      Please provide the package NVR for which the bug is seen:

      kernel-5.14.0-570.19.1.el9_6

      How reproducible is this bug?:

      Always

      Steps to reproduce

      1. Install system with  Intel VROC" RAID using <RHEL9.6 kernel.
      2. Update kernel to RHEL9.6
      3. Booting RHEL9.6 kernel crash with kernel panic. 

      Expected results

      System should boot with RHEL9.6 kernel.

      Actual results

      System crashed with following call traces.
      Stack trace:
      16.773448] ice 0000:ab:00.0: RDMA is not supported on this device
      [ 16.778065] block device autoloading is deprecated and will be removed.
      [ 16.858515] md/raid10:md124: active with 4 out of 4 devices
      [ 16.860992] general protection fault, probably for non-canonical address 0x1d93161180000028: 0000 1 PREEMPT SMP NOPTI
      [ 16.860995] CPU: 29 PID: 1362 Comm: mdmon Tainted: G OE ------- — 5.14.0-570.44.1.el9_6.x86_64 #1
      [ 16.860997] Hardware name: Jabil J322-S/EGS 2S MB1, BIOS a3008h 07/11/2024
      [ 16.860998] RIP: 0010:bitmap_get_stats+0x27/0x90
      [ 16.861004] Code: 90 90 90 0f 1f 44 00 00 48 89 f2 48 85 ff 74 75 48 8b 4f 50 48 2b 0d c0 ff d9 00 48 8b 35 c9 ff d9 00 48 c1 f9 06 48 c1 e1 0c <48> 8b 4c 31 28 48 89 4a 20 48 8b 4f 18 48 89 4a 10 48 8b 4f 10 48
      [ 16.861006] RSP: 0018:ff278c6d5e0e7c50 EFLAGS: 00010206
      [ 16.861007] RAX: ffffffff977cb360 RBX: ff1788d38032d428 RCX: 1e7b8d4000000000
      [ 16.861009] RDX: ff278c6d5e0e7c60 RSI: ff1788d180000000 RDI: ff1788d48cb26400
      [ 16.861009] RBP: ff1788d294f608e8 R08: 0000000000000001 R09: ff1788d391ce00c3
      [ 16.861010] R10: ffffffff98533d73 R11: 0000000000000000 R12: 0000000df8f9a000
      [ 16.861011] R13: ff1788d38032d018 R14: ff1788d38032d000 R15: ff1788d38032d018
      [ 16.861012] FS: 00007fa73caf2740(0000) GS:ff17894ffff40000(0000) knlGS:0000000000000000
      [ 16.861013] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
      [ 16.861014] CR2: 00007f936ce43018 CR3: 0000008091ab2006 CR4: 0000000000771ef0
      [ 16.861015] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
      [ 16.861015] DR3: 0000000000000000 DR6: 00000000fffe07f0 DR7: 0000000000000400
      [ 16.861016] PKRU: 55555554
      [ 16.861017] Call Trace:
      [ 16.861018] <TASK>
      [ 16.861019] ? show_trace_log_lvl+0x1c4/0x2df
      [ 16.861023] ? show_trace_log_lvl+0x1c4/0x2df
      [ 16.861025] ? md_seq_show+0x2c3/0x580
      [ 16.861028] ? __die_body.cold+0x8/0xd
      [ 16.861029] ? die_addr+0x39/0x60
      [ 16.861034] ? exc_general_protection+0x1ec/0x420
      [ 16.861038] ? asm_exc_general_protection+0x22/0x30
      [ 16.861043] ? __pfx_bitmap_get_stats+0x10/0x10
      [ 16.861045] ? bitmap_get_stats+0x27/0x90
      [ 16.861046] md_seq_show+0x2c3/0x580
      [ 16.861048] ? set_close_on_exec+0x2e/0x70
      [ 16.861052] seq_read_iter+0x2c3/0x4b0
      [ 16.861055] seq_read+0x146/0x190
      [ 16.861057] proc_reg_read+0x53/0xa0
      [ 16.861061] vfs_read+0xab/0x3b0
      [ 16.861065] ? seq_release+0x25/0x30
      [ 16.861067] ? kmem_cache_free+0x3f1/0x420
      [ 16.861070] ? __fget_light+0x9f/0x130
      [ 16.861073] ks
      [ 16.861073] ksys_read+0x5f/0xe0
      [ 16.861075] do_syscall_64+0x5c/0xe0
      [ 16.861077] ? syscall_exit_to_user_mode+0x19/0x40
      [ 16.861080] ? do_syscall_64+0x6b/0xe0
      [ 16.861081] ? syscall_exit_to_user_mode+0x19/0x40
      [ 16.861082] ? do_syscall_64+0x6b/0xe0
      [ 16.861083] ? syscall_exit_to_user_mode+0x19/0x40
      [ 16.861084] ? do_syscall_64+0x6b/0xe0
      [ 16.861085] ? do_syscall_64+0x6b/0xe0
      [ 16.861086] ? do_syscall_64+0x6b/0xe0
      [ 16.861087] entry_SYSCALL_64_after_hwframe+0x78/0x80
      [ 16.861089] RIP: 0033:0x7fa73cbf2ffc
      [ 16.861091] Code: ec 28 48 89 54 24 18 48 89 74 24 10 89 7c 24 08 e8 e9 8a f8 ff 48 8b 54 24 18 48 8b 74 24 10 41 89 c0 8b 7c 24 08 31 c0 0f 05 <48> 3d 00 f0 ff ff 77 34 44 89 c7 48 89 44 24 08 e8 3f 8b f8 ff 48
      [ 16.861092] RSP: 002b:00007fff0d371410 EFLAGS: 00000246 ORIG_RAX: 0000000000000000
      [ 16.861093] RAX: ffffffffffffffda RBX: 000055c4e7559350 RCX: 00007fa73cbf2ffc
      [ 16.861094] RDX: 0000000000000400 RSI: 000055c4e7557000 RDI: 0000000000000015
      [ 16.861095] RBP: 00007fa73cceb5e0 R08: 0000000000000000 R09: 00007fff0d3715b0
      [ 16.861095] R10: 00007fff0d371780 R11: 0000000000000246 R12: 0000000000000000
      [ 16.861096] R13: 0000000000000d68 R14: 00007fa73ccea9e0 R15: 0000000000000d68
      [ 16.861097] </TASK>
      [ 16.861098] Modules linked in: raid10 raid1 nvme ahci ice crct10dif_pclmul igb libahci nvme_core crc32_pclmul sfc(OE) i2c_algo_bit libie libata ghash_clmulni_intel mtd gnss nvme_auth wmi dca pinctrl_emmitsburg br_netfilter xfs bridge libcrc32c stp crc32c_intel llc overlay onload(OE) dm_multipath sfc_char(OE) dm_mirror dm_region_hash sfc_resource(OE) dm_log sfc_driverlink(OE) dm_mod fuse
      [ 16.862721] md124: detected capacity change from 0 to 30005841920
      [ 16.866190] --[ end trace 0000000000000000 ]--
      [ 16.910524] RIP: 0010:bitmap_get_stats+0x27/0x90
      [ 17.003820] ice 0000:ab:00.1: RDMA functionality is not available with the current device configuration.
      [ 17.005149] Code: 90 90 90 0f 1f 44 00 00 48 89 f2 48 85 ff 74 75 48 8b 4f 50 48 2b 0d c0 ff d9 00 48 8b 35 c9 ff d9 00 48 c1 f9 06 48 c1 e1 0c <48> 8b 4c 31 28 48 89 4a 20 48 8b 4f 18 48 89 4a 10 48 8b 4f 10 48
      [ 17.038435] ice 0000:ab:00.1: DDP package already present on device: ICE OS Default Package version 1.3.43.0
      [ 17.040024] RSP: 0018:ff278c6d5e0e7c50 EFLAGS: 00010206
      [ 17.101955] RAX: ffffffff977cb360 RBX: ff1788d38032d428 RCX: 1e7b8d4000000000
      [ 17.105152] RDX: ff278c6d5e0e7c60 RSI: ff1788d180000000 RDI: ff1788d48cb26400
      [ 17.108353] RBP: ff1788d294f608e8 R08: 0000000000000001 R09: ff1788d391ce00c3
      [ 17.111561] R10: ffffffff98533d73 R11: 0000000000000000 R12: 0000000df8f9a000
      [ 17.114776] R13: ff1788d38032d018 R14: ff1788d38032d000 R15: ff1788d38032d018
      [ 17.118007] FS: 00007fa73caf2740(0000) GS:ff17894ffff40000(0000) knlGS:0000000000000000
      [ 17.120817] ice 0000:ab:00.1: 252.048 Gb/s available PCIe bandwidth (16.0 GT/s PCIe x16 link)
      [ 17.121312] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
      [ 17.128113] CR2: 00007f936ce43018 CR3: 0000008091ab2006 CR4: 0000000000771ef0
      [ 17.131650] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
      [ 17.135180] DR3: 0000000000000000 DR6: 00000000fffe07f0 DR7: 0000000000000400
      [ 17.138578] ice 0000:ab:00.1: PTP init successful
      [ 17.138672] PKRU: 55555554
      [ 17.145442] Kernel panic - not syncing: Fatal exception
      [ 18.509786] Shutting down cpus with NMI
      [ 18.514205] Kernel Offset: 0x15e00000 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffffbfffffff)
      [ 18.585786] --[ end Kernel panic - not syncing: Fatal exception ]--

              fan.fan Fan Fan
              rhn-support-rmadhuso Ranjith ML
              Nigel Croxon Nigel Croxon
              storage-qe storage-qe
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

                Created:
                Updated: