Uploaded image for project: 'RHEL'
  1. RHEL
  2. RHEL-122651

[RHEL-10.2][ppc64le] 2nd kernel report "page allocation failure: order:0, mode:0x0(), nodemask=(null),cpuset=/,mems_allowed=0"

Linking RHIVOS CVEs to...Migration: Automation ...RHELPRIO AssignedTeam ...SWIFT: POC ConversionSync from "Extern...XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Undefined Undefined
    • None
    • rhel-10.2
    • kdump-utils
    • None
    • None
    • rhel-kernel-debug
    • 0
    • False
    • False
    • Hide

      None

      Show
      None
    • None
    • None
    • None
    • None
    • Unspecified
    • Unspecified
    • Unspecified
    • ppc64le
    • None

      What were you trying to do that didn't work?

      Tested on RHEL-10.2-20251009.1 on ppc64le machine,invoke dump using panic_on_oops with the kdump default configuration(crashkernel=512M), it will report "page allocation failure: order:0, mode:0x0(), nodemask=(null),cpuset=/,mems_allowed=0" but it can generate a vmcore file.
      set "crashkernel=768M", it will not report this issue.

      [  319.252849] crasher: loading out-of-tree module taints kernel.
      [  319.252862] crasher: module verification failed: signature and/or required key missing - tainting kernel
      [  319.253304] loaded crasher module
      [  344.241782] Kernel attempted to write user page (1) - exploit attempt? (uid: 0)
      [  344.241799] BUG: Kernel NULL pointer dereference on write at 0x00000001
      [  344.241803] Faulting instruction address: 0xc0080000071700f8
      [  344.241808] Oops: Kernel access of bad area, sig: 11 [#1]
      [  344.241811] LE PAGE_SIZE=64K MMU=Hash  SMP NR_CPUS=8192 NUMA pSeries
      [  344.241817] Modules linked in: crasher(OE) bonding tls rfkill sunrpc ibmveth sg pseries_rng vmx_crypto fuse loop nfnetlink vsock_loopback vmw_vsock_virtio_transport_common vsock xfs sd_mod ibmvscsi scsi_transport_srp dm_mirror dm_region_hash dm_log dm_mod
      [  344.241848] CPU: 3 UID: 0 PID: 7375 Comm: runtest.sh Kdump: loaded Tainted: G           OE      ------  ---  6.12.0-139.el10.ppc64le #1 VOLUNTARY 
      [  344.241856] Tainted: [O]=OOT_MODULE, [E]=UNSIGNED_MODULE
      [  344.241859] Hardware name: IBM,8375-42A POWER9 (architected) 0x4e0202 0xf000005 of:IBM,FW940.02 (VL940_041) hv:phyp pSeries
      [  344.241863] NIP:  c0080000071700f8 LR: c008000007170050 CTR: c0080000071700f0
      [  344.241867] REGS: c0000000834df9b0 TRAP: 0300   Tainted: G           OE      ------  ---   (6.12.0-139.el10.ppc64le)
      [  344.241872] MSR:  8000000000009033 <SF,EE,ME,IR,DR,RI,LE>  CR: 88222481  XER: 20040000
      [  344.241884] CFAR: c00000000000db4c DAR: 0000000000000001 DSISR: 42000000 IRQMASK: 0 
      GPR00: c008000007170050 c0000000834dfc50 c008000005c38100 0000000000000000 
      GPR04: 00000100385c07d0 0000000000000001 c0000000834dfdd0 00000100385c07d0 
      GPR08: ffffffffa8aaaaaa 0000000000000000 0000000000000041 c008000007170340 
      GPR12: c000000000a4e1b0 c000000017ffcb00 0000000000000000 0000000000000000 
      GPR16: 0000000000000000 0000000000000000 0000000000000000 0000000000000000 
      GPR20: 0000000000000000 0000000000000000 0000000000000000 0000000000000000 
      GPR24: 0000000000000000 0000000000000000 0000000000000000 0000000000000000 
      GPR28: 00000100385c07d0 c0000000834dfdd0 fffffffffffffffb 0000000000000002 
      [  344.241940] NIP [c0080000071700f8] crasher_write+0xf0/0x158 [crasher]
      [  344.241947] LR [c008000007170050] crasher_write+0x48/0x158 [crasher]
      [  344.241951] Call Trace:
      [  344.241953] [c0000000834dfc50] [c008000007170050] crasher_write+0x48/0x158 [crasher] (unreliable)
      [  344.241960] [c0000000834dfcd0] [c00000000083467c] proc_reg_write+0xfc/0x160
      [  344.241969] [c0000000834dfd00] [c00000000073da00] vfs_write+0x160/0x500
      [  344.241975] [c0000000834dfdb0] [c00000000073dfac] ksys_write+0x8c/0x160
      [  344.241981] [c0000000834dfe00] [c000000000030f68] system_call_exception+0x138/0x2d0
      [  344.241987] [c0000000834dfe50] [c00000000000d05c] system_call_vectored_common+0x15c/0x2ec
      [  344.241994] --- interrupt: 3000 at 0x7fff9c4f4d04
      [  344.242001] NIP:  00007fff9c4f4d04 LR: 00007fff9c4f4d04 CTR: 0000000000000000
      [  344.242005] REGS: c0000000834dfe80 TRAP: 3000   Tainted: G           OE      ------  ---   (6.12.0-139.el10.ppc64le)
      [  344.242010] MSR:  800000000280f033 <SF,VEC,VSX,EE,PR,FP,ME,IR,DR,RI,LE>  CR: 48222481  XER: 00000000
      [  344.242025] IRQMASK: 0 
      GPR00: 0000000000000004 00007fffcbddc9b0 00007fff9c5f7100 0000000000000001 
      GPR04: 00000100385c07d0 0000000000000002 0000000000000032 0000000000000001 
      GPR08: 0000000000000000 0000000000000000 0000000000000000 0000000000000000 
      GPR12: 0000000000000000 00007fff9c70ae80 0000000111960e00 00000100388ea620 
      GPR16: 0000000111a99528 000001003858f1d0 0000000040000000 00000001119a2c40 
      GPR20: 0000000000000000 00007fffcbddcbd4 0000000000000001 0000000111a42968 
      GPR24: 0000000111a98bac 0000000111a98bb8 0000000000000000 00000100385c07d0 
      GPR28: 0000000000000002 00007fff9c5f17f0 00000100385c07d0 0000000000000002 
      [  344.242076] NIP [00007fff9c4f4d04] 0x7fff9c4f4d04
      [  344.242080] LR [00007fff9c4f4d04] 0x7fff9c4f4d04
      [  344.242084] --- interrupt: 3000
      [  344.242086] Code: 39200000 4082005c 38210080 e8010010 ebe1fff8 7c0803a6 4e800020 60000000 60000000 60000000 39200000 39400041 <99490001> 4bffffbc 0fe00000 38610064 
      [  344.242106] ---[ end trace 0000000000000000 ]---
      [  344.242668] 
      [  344.242673] Sending IPI to other CPUs
      [  344.242700] IPI complete
      [  344.263880] kexec: Starting switchover sequence.
      ....
      [Fri Oct 17 04:56:59 2025] random: crng init done
      [Fri Oct 17 04:56:59 2025] OF: fdt: Reserving 4 KiB of memory at 0x1b100000 for elfcorehdr
      [Fri Oct 17 04:56:59 2025] nr_cpu_ids was not a multiple of threads_per_core, adjusted to 8
      [Fri Oct 17 04:56:59 2025] hash-mmu: Page sizes from device-tree:
      [Fri Oct 17 04:56:59 2025] hash-mmu: base_shift=12: shift=12, sllp=0x0000, avpnm=0x00000000, tlbiel=1, penc=0
      [Fri Oct 17 04:56:59 2025] hash-mmu: base_shift=12: shift=16, sllp=0x0000, avpnm=0x00000000, tlbiel=1, penc=7
      [Fri Oct 17 04:56:59 2025] hash-mmu: base_shift=12: shift=24, sllp=0x0000, avpnm=0x00000000, tlbiel=1, penc=56
      [Fri Oct 17 04:56:59 2025] hash-mmu: base_shift=16: shift=16, sllp=0x0110, avpnm=0x00000000, tlbiel=1, penc=1
      [Fri Oct 17 04:56:59 2025] hash-mmu: base_shift=16: shift=24, sllp=0x0110, avpnm=0x00000000, tlbiel=1, penc=8
      [Fri Oct 17 04:56:59 2025] hash-mmu: base_shift=24: shift=24, sllp=0x0100, avpnm=0x00000001, tlbiel=0, penc=0
      [Fri Oct 17 04:56:59 2025] hash-mmu: base_shift=34: shift=34, sllp=0x0120, avpnm=0x000007ff, tlbiel=0, penc=3
      [Fri Oct 17 04:56:59 2025] fadump: WARNING: Could not setup area to pass additional parameters!
      [Fri Oct 17 04:56:59 2025] Enabling pkeys with max key count 31
      [Fri Oct 17 04:56:59 2025] Activating Kernel Userspace Access Prevention
      [Fri Oct 17 04:56:59 2025] Activating Kernel Userspace Execution Prevention
      [Fri Oct 17 04:56:59 2025] Page orders: linear mapping = 16, virtual = 16, io = 16, vmemmap = 16
      [Fri Oct 17 04:56:59 2025] Using 1TB segments
      [Fri Oct 17 04:56:59 2025] hash-mmu: Initializing hash mmu with SLB
      [Fri Oct 17 04:56:59 2025] Linux version 6.12.0-139.el10.ppc64le (mockbuild@dd55202d02f94949b6e97258e03c6c65) (gcc (GCC) 14.3.1 20250617 (Red Hat 14.3.1-2), GNU ld version 2.41-58.el10) #1 SMP Wed Oct  8 06:55:13 EDT 2025
      [Fri Oct 17 04:56:59 2025] Secure boot mode disabled
      [Fri Oct 17 04:56:59 2025] Found initrd at 0xc00000001b180000:0xc00000001d6f8200
      [Fri Oct 17 04:56:59 2025] Hardware name: IBM,8375-42A POWER9 (architected) 0x4e0202 0xf000005 of:IBM,FW940.02 (VL940_041) hv:phyp pSeries
      .....
      [Fri Oct 17 04:57:13 2025] sd 1:0:1:0: [sda] Attached SCSI disk
      [Fri Oct 17 04:57:15 2025] SGI XFS with ACLs, security attributes, scrub, quota, no debug enabled
      [Fri Oct 17 04:57:15 2025] XFS: attr2 mount option is deprecated.
      [Fri Oct 17 04:57:15 2025] XFS (dm-0): Mounting V5 Filesystem 069d83b9-dbd2-4c4d-83e9-4858bbe9ccf6
      [Fri Oct 17 04:57:15 2025] XFS (dm-0): Starting recovery (logdev: internal)
      [Fri Oct 17 04:57:16 2025] XFS (dm-0): Ending recovery (logdev: internal)
      [Fri Oct 17 04:57:16 2025] kworker/0:1H: page allocation failure: order:0, mode:0x0(), nodemask=(null),cpuset=/,mems_allowed=0
      [Fri Oct 17 04:57:16 2025] CPU: 0 UID: 0 PID: 207 Comm: kworker/0:1H Not tainted 6.12.0-139.el10.ppc64le #1 VOLUNTARY 
      [Fri Oct 17 04:57:16 2025] Hardware name: IBM,8375-42A POWER9 (architected) 0x4e0202 0xf000005 of:IBM,FW940.02 (VL940_041) hv:phyp pSeries
      [Fri Oct 17 04:57:16 2025] Workqueue: kblockd blk_mq_run_work_fn
      [Fri Oct 17 04:57:16 2025] Call Trace:
      [Fri Oct 17 04:57:16 2025] [c0000000230575c0] [c00000001924df84] dump_stack_lvl+0x88/0xc4 (unreliable)
      [Fri Oct 17 04:57:16 2025] [c0000000230575f0] [c00000001866aa80] warn_alloc+0x150/0x1d0
      [Fri Oct 17 04:57:16 2025] [c000000023057690] [c00000001866b350] __alloc_pages_slowpath.constprop.0+0x850/0xb20
      [Fri Oct 17 04:57:16 2025] [c0000000230577c0] [c00000001866b95c] __alloc_pages_noprof+0x33c/0x3a0
      [Fri Oct 17 04:57:16 2025] [c000000023057840] [c000000018059840] iommu_alloc_coherent+0xd0/0x2b0
      [Fri Oct 17 04:57:16 2025] [c000000023057910] [c0000000180575e8] dma_iommu_alloc_coherent+0x38/0x50
      [Fri Oct 17 04:57:16 2025] [c000000023057930] [c00000001834aa44] dma_alloc_attrs+0x2a4/0x3b0
      [Fri Oct 17 04:57:16 2025] [c0000000230579a0] [c0080000011d3ff8] map_sg_data+0x190/0x300 [ibmvscsi]
      [Fri Oct 17 04:57:16 2025] [c000000023057a40] [c0080000011d42c4] ibmvscsi_queuecommand_lck+0x15c/0x2d0 [ibmvscsi]
      [Fri Oct 17 04:57:16 2025] [c000000023057ad0] [c0080000011d447c] ibmvscsi_queuecommand+0x44/0x84 [ibmvscsi]
      [Fri Oct 17 04:57:16 2025] [c000000023057b10] [c000000018cef7ac] scsi_dispatch_cmd+0xbc/0x300
      [Fri Oct 17 04:57:16 2025] [c000000023057b90] [c000000018cf0b9c] scsi_queue_rq+0x57c/0x810
      [Fri Oct 17 04:57:16 2025] [c000000023057c30] [c0000000189a8130] blk_mq_dispatch_rq_list+0x160/0x660
      [Fri Oct 17 04:57:16 2025] [c000000023057cc0] [c0000000189b0c08] __blk_mq_do_dispatch_sched+0x458/0x480
      [Fri Oct 17 04:57:16 2025] [c000000023057d70] [c0000000189b1228] __blk_mq_sched_dispatch_requests+0x1d8/0x250
      [Fri Oct 17 04:57:16 2025] [c000000023057de0] [c0000000189b1334] blk_mq_sched_dispatch_requests+0x44/0xb0
      [Fri Oct 17 04:57:16 2025] [c000000023057e10] [c0000000189a0c28] blk_mq_run_work_fn+0x98/0xd0
      [Fri Oct 17 04:57:16 2025] [c000000023057e40] [c00000001825d8f4] process_one_work+0x1f4/0x520
      [Fri Oct 17 04:57:16 2025] [c000000023057ef0] [c00000001825f2dc] worker_thread+0x33c/0x510
      [Fri Oct 17 04:57:16 2025] [c000000023057f90] [c00000001826c124] kthread+0x154/0x170
      [Fri Oct 17 04:57:16 2025] [c000000023057fe0] [c00000001800ded8] start_kernel_thread+0x14/0x18
      [Fri Oct 17 04:57:16 2025] Mem-Info:
      [Fri Oct 17 04:57:16 2025] active_anon:988 inactive_anon:0 isolated_anon:0
                                  active_file:678 inactive_file:1640 isolated_file:0
                                  unevictable:15 dirty:172 writeback:384
                                  slab_reclaimable:153 slab_unreclaimable:1580
                                  mapped:277 shmem:405 pagetables:80
                                  sec_pagetables:0 bounce:0
                                  kernel_misc_reclaimable:0
                                  free:367 free_pcp:465 free_cma:0
      [Fri Oct 17 04:57:16 2025] Node 0 active_anon:63232kB inactive_anon:0kB active_file:43392kB inactive_file:104960kB unevictable:960kB isolated(anon):0kB isolated(file):0kB mapped:17728kB dirty:11008kB writeback:24576kB shmem:25920kB shmem_thp:0kB shmem_pmdmapped:0kB anon_thp:0kB writeback_tmp:0kB kernel_stack:2448kB pagetables:5120kB sec_pagetables:0kB all_unreclaimable? no Balloon:0kB
      [Fri Oct 17 04:57:16 2025] Node 0 Normal free:25408kB boost:16384kB min:18816kB low:19392kB high:19968kB reserved_highatomic:0KB active_anon:62016kB inactive_anon:0kB active_file:44672kB inactive_file:104576kB unevictable:960kB writepending:34688kB present:917504kB managed:420352kB mlocked:0kB bounce:0kB free_pcp:29760kB local_pcp:5312kB free_cma:0kB
      [Fri Oct 17 04:57:16 2025] lowmem_reserve[]: 0 0 0
      [Fri Oct 17 04:57:16 2025] Node 0 Normal: 174*64kB (UME) 39*128kB (UME) 14*256kB (UME) 2*512kB (ME) 1*1024kB (E) 2*2048kB (E) 0*4096kB 0*8192kB 0*16384kB = 25856kB
      [Fri Oct 17 04:57:16 2025] Node 0 hugepages_total=0 hugepages_free=0 hugepages_surp=0 hugepages_size=16384kB
      [Fri Oct 17 04:57:16 2025] Node 0 hugepages_total=0 hugepages_free=0 hugepages_surp=0 hugepages_size=16777216kB
      [Fri Oct 17 04:57:16 2025] 2752 total pagecache pages
      [Fri Oct 17 04:57:16 2025] 0 pages in swap cache
      [Fri Oct 17 04:57:16 2025] Free swap  = 0kB
      [Fri Oct 17 04:57:16 2025] Total swap = 0kB
      [Fri Oct 17 04:57:16 2025] 14336 pages RAM
      [Fri Oct 17 04:57:16 2025] 0 pages HighMem/MovableOnly
      [Fri Oct 17 04:57:16 2025] 7768 pages reserved
      [Fri Oct 17 04:57:16 2025] 0 pages cma reserved
      [Fri Oct 17 04:57:16 2025] 0 pages hwpoisoned
      [Fri Oct 17 04:57:16 2025] sd 1:0:1:0: Can't allocate memory for indirect table
      .....
      pivot and cleanup hook.
      Oct 17 04:57:16 ibm-p9z-25-lp3.virt.pnr.lab.eng.rdu2.redhat.com systemd[1]: Starting kdump-capture.service - Kdump Vmcore Save Service...
      Oct 17 04:57:16 ibm-p9z-25-lp3.virt.pnr.lab.eng.rdu2.redhat.com kdump[554]: Kdump is using the default log level(3).
      Oct 17 04:57:16 ibm-p9z-25-lp3.virt.pnr.lab.eng.rdu2.redhat.com kdump[598]: saving to /sysroot/var/crash/127.0.0.1-2025-10-17-04:57:16/
      Oct 17 04:57:16 ibm-p9z-25-lp3.virt.pnr.lab.eng.rdu2.redhat.com kdump[603]: saving vmcore-dmesg.txt to /sysroot/var/crash/127.0.0.1-2025-10-17-04:57:16/
      Oct 17 04:57:16 ibm-p9z-25-lp3.virt.pnr.lab.eng.rdu2.redhat.com kdump[609]: saving vmcore-dmesg.txt complete
      Oct 17 04:57:16 ibm-p9z-25-lp3.virt.pnr.lab.eng.rdu2.redhat.com kdump[611]: saving vmcore
      Oct 17 04:57:17 ibm-p9z-25-lp3.virt.pnr.lab.eng.rdu2.redhat.com kdump.sh[612]: 
      Checking for memory holes                         : [  0.0 %] /                  
      Checking for memory holes                         : [100.0 %] |                  
      Excluding unnecessary pages                       : [100.0 %] \                  
      Copying data                                      : [ 70.9 %] -           eta: 0s
      Copying data                                      : [100.0 %] /           eta: 0s
      Oct 17 04:57:17 ibm-p9z-25-lp3.virt.pnr.lab.eng.rdu2.redhat.com kdump.sh[612]: 
      Copying data                                      : [100.0 %] |           eta: 0s
      Oct 17 04:57:17 ibm-p9z-25-lp3.virt.pnr.lab.eng.rdu2.redhat.com kdump.sh[612]: The dumpfile is saved to /sysroot/var/crash/127.0.0.1-2025-10-17-04:57:16//vmcore-incomplete.
      Oct 17 04:57:17 ibm-p9z-25-lp3.virt.pnr.lab.eng.rdu2.redhat.com kdump.sh[612]: makedumpfile Completed.
      Oct 17 04:57:17 ibm-p9z-25-lp3.virt.pnr.lab.eng.rdu2.redhat.com kdump[625]: saving vmcore complete
      
      

      beaker job: https://beaker.engineering.redhat.com/jobs/11798857

      Please provide the package NVR for which the bug is seen:

      RHEL-10.2-20251009.1
      kernel-6.12.0-139.el10
      kdump-utils-1.0.54-7.el10
      kexec-tools-2.0.31-1.el10

      How reproducible is this bug?:

      repeatedly run 10 times on a machine, it will be reproduced at least 2 times.
      it is easy to reproduce this issue:
      ibm-p9z-25-lp3.virt.pnr.lab.eng.rdu2.redhat.com
      ibm-p9z-25-lp8.virt.pnr.lab.eng.rdu2.redhat.com
      ibm-p9z-25-lp9.virt.pnr.lab.eng.rdu2.redhat.com
      ibm-p9z-25-lp6.virt.pnr.lab.eng.rdu2.redhat.com

      Steps to reproduce

      1.  install RHEL-10.2-20251009.1 on ppc64le machine 
      2.  Make module crasher:
          # get crash-crasher code from : https://gitlab.com/redhat/centos-stream/tests/kernel/kernel-tests/-/tree/main/kdump/crash-crasher?ref_type=heads 
           # make -C crasher
           # insmod crasher/crasher.ko
      3. Enable panic_on_oops : 
           #  echo 1 > /proc/sys/kernel/panic_on_oops
      4.  Trigger crasher panic:
           #  echo "2" > /proc/crasher
      

      Expected results

      Actual results

              piliu@redhat.com Pingfan Liu
              rhn-support-yiyan Xiaoying Yan
              kdump team kdump team
              Xiaoying Yan Xiaoying Yan
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

                Created:
                Updated: