Loading...

Linking RHIVOS CVEs to...

Migration: Automation ...

Sync from "Extern...

XML

Word

Printable

Type: Bug
Resolution: Won't Do
Priority: Undefined
Fix Version/s: None
Affects Version/s: rhel-8.6.0
Component/s: nvme-cli
Labels:
- MigratedToJIRA

Regression:
None
Severity:
Important
AssignedTeam:
rhel-storage-io-2
Sub-System Group:

ssg_filesystems_storage_and_HA

Story Points:
None
Blocked:
False
Ready:
False
Blocked Reason:

Hide

None

Show
None
Product Documentation Required:
None
Sprint:
None

Preliminary Testing:
None
Test Coverage:
None

Release Note Type:
If docs needed, set a value

Experience:
Architecture:

x86_64
Bugzilla Bug:
RHBZ: 2192922

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Planning:
None
Internal Target Milestone numeric:
57,005

Description of problem:
After disrupting the initiator link for two minutes, the NVME controllers and namespaces are not getting recovered on a timely basis after the lpfc driver successfully recovers the FC logins and successfully reregisters the remote ports. The recovery of NVME controllers and namespaces can take between 9 minutes to 61 minutes.

Version-Release number of selected component (if applicable):
Issue is seen on:

Linux dhcp-10-231-139-179 4.18.0-372.9.1.el8.x86_64 #1 SMP Fri Apr 15 22:12:19 EDT 2022 x86_64 x86_64 x86_64 GNU/Linux

[root@dhcp-10-231-139-179 ~]# cat /etc/os-release
NAME="Red Hat Enterprise Linux"
VERSION="8.6 (Ootpa)"
ID="rhel"
ID_LIKE="fedora"
VERSION_ID="8.6"
PLATFORM_ID="platform:el8"
PRETTY_NAME="Red Hat Enterprise Linux 8.6 (Ootpa)"

How reproducible:
Always. Time to reproduction is about 10 minutes

Steps to Reproduce:
1. Map few SCSI and NVMe NS from a target to both the HBA ports. Enable
multipath and NVMe ANA to detect the multipath devices. The IBM9500 target is in use for ECD, but any target capable of FCP and NVME will do.

Zone config:

Zone1: HBA Port0 + SCSI Tgt Port0 + NVMe Tgt Port0
Zone2: HBA Port1 + SCSI Tgt Port0 + NVMe Tgt Port0

[root@dhcp-10-231-133-36 ~]# nvme list-subsys
nvme-subsys0 - NQN=nqn.1986-03.com.ibm:nvme:2145.00000204E0607C1E
\
+- nvme0 fc traddr=nn-0x5005076813003e0f:pn-0x50050768131b3e0f
host_traddr=nn-0x200000109bf67eba:pn-0x100000109bf67eba live
+- nvme1 fc traddr=nn-0x5005076813003e0f:pn-0x50050768131b3e0f
host_traddr=nn-0x200000109bf67ebb:pn-0x100000109bf67ebb live

2. Do a port shut from Cisco64G switch. Enable the port after a sleep of 120
secs. Again, this is not a switch issue so any vendor should be OK.

3. SCSI luns got detected, but NVMe controllers did not detected even after
waiting for more than ~10 minutes.

Actual results:
NVME paths do not show up for long periods of time.

Expected results:
SCSI and NVME pathing should recover in a reasonable amount of time.

Additional info:

external trackers

Red Hat Issue Tracker RHELPLAN-156312

Assignee:: Maurizio Lombardi

Reporter:: Paul Ely

Contributing Groups:: Broadcom ECD Confidential Group

Developer:: Maurizio Lombardi

QA Contact:: Yi Zhang

Votes:: 0 Vote for this issue

Watchers:: 9 Start watching this issue

Created:: 2023/09/23 1:01 PM

Updated:: 2025/10/07 1:40 AM

Resolved:: 2024/02/22 4:48 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates