Uploaded image for project: 'RHEL'
  1. RHEL
  2. RHEL-6495

[e1000e] Intel 82574L Reset adapter unexpectedly

    • Major
    • sst_network_drivers
    • ssg_networking
    • False
    • Hide

      None

      Show
      None
    • If docs needed, set a value

      Hi:
      I upgrade several servers from RHEL7/RHEL8 to RHEL9.0/9.1. all of their 82574L nics will reset from time to time. at first I think maybe the nic is falling. but now every 82574L including onboard or pci-card behave the same way. these servers are working fine under RHEL7/8. hope they can survive under RHEL9. there maybe similar bug #1939009

      the reset message is like below:

      [Mon Feb 13 08:39:52 2023] -----------[ cut here ]-----------
      [Mon Feb 13 08:39:52 2023] NETDEV WATCHDOG: enp7s0 (e1000e): transmit queue 0 timed out
      [Mon Feb 13 08:39:52 2023] WARNING: CPU: 21 PID: 0 at net/sched/sch_generic.c:529 dev_watchdog+0x1f9/0x200
      [Mon Feb 13 08:39:52 2023] Modules linked in: vhost_net vhost vhost_iotlb tap tun drbd_transport_tcp(OE) drbd(OE) bonding ib_uverbs ib_core mlx4_en(OE) tls bridge stp llc rfkill intel_rapl_msr intel_rapl_common sb_edac x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm irqbypass ipmi_ssif iTCO_wdt rapl iTCO_vendor_support mlx4_core(OE) acpi_ipmi sunrpc intel_cstate ipmi_si mei_me i2c_i801 e1000e joydev pcspkr mei ipmi_devintf i2c_smbus mxm_wmi intel_uncore ioatdma ipmi_msghandler lpc_ich dca fuse xfs raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx raid6_pq raid1 libcrc32c sd_mod t10_pi sg mgag200 i2c_algo_bit drm_shmem_helper drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops isci(OE) libsas drm ahci scsi_transport_sas libahci crct10dif_pclmul crc32_pclmul crc32c_intel libata ghash_clmulni_intel wmi dm_mirror dm_region_hash dm_log dm_mod
      [Mon Feb 13 08:39:52 2023] CPU: 21 PID: 0 Comm: swapper/21 Tainted: G OE --------- — 5.14.0-162.12.1.el9_1.x86_64 #1
      [Mon Feb 13 08:39:52 2023] Hardware name: Supermicro X9DRL-3F/iF/X9DRL-3F/iF, BIOS 3.3 01/30/2019
      [Mon Feb 13 08:39:52 2023] RIP: 0010:dev_watchdog+0x1f9/0x200
      [Mon Feb 13 08:39:52 2023] Code: 00 e9 40 ff ff ff 48 89 ef c6 05 7f 3d 68 01 01 e8 cc af f9 ff 44 89 e9 48 89 ee 48 c7 c7 e0 49 e0 b5 48 89 c2 e8 bf fd 17 00 <0f> 0b e9 22 ff ff ff 0f 1f 44 00 00 41 54 55 53 48 89 fb 48 8b 6f
      [Mon Feb 13 08:39:52 2023] RSP: 0018:ffffa69883758eb0 EFLAGS: 00010282
      [Mon Feb 13 08:39:52 2023] RAX: 0000000000000000 RBX: ffff90b9c5608480 RCX: 000000000000083f
      [Mon Feb 13 08:39:52 2023] RDX: 0000000000000000 RSI: 00000000000000f6 RDI: 000000000000003f
      [Mon Feb 13 08:39:52 2023] RBP: ffff90b9c5608000 R08: 0000000000000000 R09: ffffa69883758cf0
      [Mon Feb 13 08:39:52 2023] R10: ffffa69883758ce8 R11: ffffffffb67e9128 R12: ffff90b9c56083dc
      [Mon Feb 13 08:39:52 2023] R13: 0000000000000000 R14: ffffffffb530ca10 R15: ffff90bd2fd5a440
      [Mon Feb 13 08:39:52 2023] FS: 0000000000000000(0000) GS:ffff90bd2fd40000(0000) knlGS:0000000000000000
      [Mon Feb 13 08:39:52 2023] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
      [Mon Feb 13 08:39:52 2023] CR2: 00007f573f136000 CR3: 0000000211210002 CR4: 00000000001726e0
      [Mon Feb 13 08:39:52 2023] Call Trace:
      [Mon Feb 13 08:39:52 2023] <IRQ>
      [Mon Feb 13 08:39:52 2023] ? dequeue_skb+0x500/0x500
      [Mon Feb 13 08:39:52 2023] call_timer_fn+0x24/0x130
      [Mon Feb 13 08:39:52 2023] __run_timers.part.0+0x1cc/0x270
      [Mon Feb 13 08:39:52 2023] ? __hrtimer_run_queues+0x139/0x2c0
      [Mon Feb 13 08:39:52 2023] ? ktime_get+0x35/0xa0
      [Mon Feb 13 08:39:52 2023] run_timer_softirq+0x26/0x50
      [Mon Feb 13 08:39:52 2023] __do_softirq+0xc7/0x2ac
      [Mon Feb 13 08:39:52 2023] __irq_exit_rcu+0xb5/0xe0
      [Mon Feb 13 08:39:52 2023] sysvec_apic_timer_interrupt+0x72/0x90
      [Mon Feb 13 08:39:52 2023] </IRQ>
      [Mon Feb 13 08:39:52 2023] asm_sysvec_apic_timer_interrupt+0x16/0x20
      [Mon Feb 13 08:39:52 2023] RIP: 0010:cpuidle_enter_state+0xd2/0x360
      [Mon Feb 13 08:39:52 2023] Code: 49 89 c5 0f 1f 44 00 00 31 ff e8 19 84 92 ff 45 84 ff 74 12 9c 58 f6 c4 02 0f 85 75 02 00 00 31 ff e8 e2 74 98 ff fb 45 85 f6 <0f> 88 15 01 00 00 49 63 d6 4c 2b 2c 24 48 8d 04 52 48 8d 04 82 49
      [Mon Feb 13 08:39:52 2023] RSP: 0018:ffffa69883353e98 EFLAGS: 00000202
      [Mon Feb 13 08:39:52 2023] RAX: ffff90bd2fd6af40 RBX: 0000000000000002 RCX: 000000000000001f
      [Mon Feb 13 08:39:52 2023] RDX: 0000000000000000 RSI: 00000000313b14ef RDI: 0000000000000000
      [Mon Feb 13 08:39:52 2023] RBP: ffffc6947fd40370 R08: 000018a15ac535bb R09: 0000000000000001
      [Mon Feb 13 08:39:52 2023] R10: 0000000000032c6f R11: 0000000000058562 R12: ffffffffb68e6c60
      [Mon Feb 13 08:39:52 2023] R13: 000018a15ac535bb R14: 0000000000000002 R15: 0000000000000000
      [Mon Feb 13 08:39:52 2023] ? cpuidle_enter_state+0xb7/0x360
      [Mon Feb 13 08:39:52 2023] cpuidle_enter+0x29/0x40
      [Mon Feb 13 08:39:52 2023] cpuidle_idle_call+0x12c/0x1c0
      [Mon Feb 13 08:39:52 2023] do_idle+0x7b/0xe0
      [Mon Feb 13 08:39:52 2023] cpu_startup_entry+0x19/0x20
      [Mon Feb 13 08:39:52 2023] secondary_startup_64_no_verify+0xc3/0xcb
      [Mon Feb 13 08:39:52 2023] --[ end trace dd70479d1bcea003 ]--
      [Mon Feb 13 08:39:52 2023] e1000e 0000:07:00.0 enp7s0: Reset adapter unexpectedly
      [Mon Feb 13 08:39:52 2023] br0: port 1(enp7s0) entered disabled state
      [Mon Feb 13 08:39:56 2023] e1000e 0000:07:00.0 enp7s0: NIC Link is Up 1000 Mbps Full Duplex, Flow Control: None
      [Mon Feb 13 08:39:56 2023] br0: port 1(enp7s0) entered blocking state
      [Mon Feb 13 08:39:56 2023] br0: port 1(enp7s0) entered forwarding state

            rhn-engineering-jkc Ken Cox
            jira-bugzilla-migration RH Bugzilla Integration
            Ken Cox Ken Cox
            Dipali Patel Dipali Patel
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

              Created:
              Updated: