Uploaded image for project: 'RHEL'
  1. RHEL
  2. RHEL-67472

port_state remains Online after sending FPIN link integrity events

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Undefined Undefined
    • None
    • rhel-9.6
    • None
    • No
    • None
    • rhel-sst-logical-storage
    • ssg_filesystems_storage_and_HA
    • None
    • False
    • Hide

      None

      Show
      None
    • None
    • None
    • None
    • None
    • x86_64
    • None

      What were you trying to do that didn't work?

      On a system with both FC LUNs and NVMe-FC namespaces, I enabled device-mapper-multipath and added the following line to the defaults section of /etc/multipath.conf:

      marginal_pathgroups fpin

      multipathd was then restarted and I began to send link integrity events from the Brocade switch (/fabos/cliexec/ftc test --fpin 020700 -li -link_failure). At that point, I was able to see port_states become Marginal which is what I expected:

       

       

      [root@storageqe-14 ~]# for i in `find /sys -name port_state | fgrep rport`; do echo $i; cat $i; done
      /sys/devices/pci0000:40/0000:40:01.1/0000:41:00.0/host10/rport-10:0-2/fc_remote_ports/rport-10:0-2/port_state
      Online
      /sys/devices/pci0000:40/0000:40:01.1/0000:41:00.0/host10/rport-10:0-0/fc_remote_ports/rport-10:0-0/port_state
      Online
      /sys/devices/pci0000:40/0000:40:01.1/0000:41:00.0/host10/rport-10:0-7/fc_remote_ports/rport-10:0-7/port_state
      Online
      /sys/devices/pci0000:40/0000:40:01.1/0000:41:00.0/host10/rport-10:0-5/fc_remote_ports/rport-10:0-5/port_state
      Marginal
      /sys/devices/pci0000:40/0000:40:01.1/0000:41:00.0/host10/rport-10:0-3/fc_remote_ports/rport-10:0-3/port_state
      Online
      /sys/devices/pci0000:40/0000:40:01.1/0000:41:00.0/host10/rport-10:0-1/fc_remote_ports/rport-10:0-1/port_state
      Online
      /sys/devices/pci0000:40/0000:40:01.1/0000:41:00.0/host10/rport-10:0-8/fc_remote_ports/rport-10:0-8/port_state
      Online
      /sys/devices/pci0000:40/0000:40:01.1/0000:41:00.0/host10/rport-10:0-6/fc_remote_ports/rport-10:0-6/port_state
      Online
      /sys/devices/pci0000:40/0000:40:01.1/0000:41:00.0/host10/rport-10:0-4/fc_remote_ports/rport-10:0-4/port_state
      Marginal
      

       

       

      I then disabled/enabled the switch port to clear the condition. Furthermore, I removed all the FC LUNs by running the following command:

       

      1. echo 1 > /sys/block/sdc/device/delete

      Now I only have NVMe namespaces monitored by device-mapper-multipath:

       

      [root@storageqe-14 ~]# multipath -ll
      mpatha (uuid.cf79fefe-a141-4cd8-921e-f2c57511718e) dm-8 NVME,NetApp ONTAP Controller                 
      size=50G features='1 queue_if_no_path' hwhandler='0' wp=rw
      |-+- policy='service-time 0' prio=50 status=active
      | `- 8:41408:5:5 nvme8n5 259:19 active ready running
      |-+- policy='service-time 0' prio=10 status=enabled
      | `- 7:41409:5:5 nvme7n5 259:12 active ready running
      |-+- policy='service-time 0' prio=50 status=enabled
      | `- 5:41472:5:5 nvme5n5 259:4  active ready running
      `-+- policy='service-time 0' prio=10 status=enabled
        `- 6:41473:5:5 nvme6n5 259:5  active ready running
      

      I again issue the same FPIN link integrity events from the switch. Although I can see the  link_failure  counters increment, the port_state remains Online:

      1. cat /sys/class/fc_host/host10/statistics/fpin_li_link_failure_count
        0xa

       

       

      oot@storageqe-14 ~]# for i in `find /sys -name port_state | fgrep rport`; do echo $i; cat $i; done
      /sys/devices/pci0000:40/0000:40:01.1/0000:41:00.0/host10/rport-10:0-2/fc_remote_ports/rport-10:0-2/port_state
      Online
      /sys/devices/pci0000:40/0000:40:01.1/0000:41:00.0/host10/rport-10:0-0/fc_remote_ports/rport-10:0-0/port_state
      Online
      /sys/devices/pci0000:40/0000:40:01.1/0000:41:00.0/host10/rport-10:0-7/fc_remote_ports/rport-10:0-7/port_state
      Online
      /sys/devices/pci0000:40/0000:40:01.1/0000:41:00.0/host10/rport-10:0-5/fc_remote_ports/rport-10:0-5/port_state
      Online
      /sys/devices/pci0000:40/0000:40:01.1/0000:41:00.0/host10/rport-10:0-3/fc_remote_ports/rport-10:0-3/port_state
      Online
      /sys/devices/pci0000:40/0000:40:01.1/0000:41:00.0/host10/rport-10:0-1/fc_remote_ports/rport-10:0-1/port_state
      Online
      /sys/devices/pci0000:40/0000:40:01.1/0000:41:00.0/host10/rport-10:0-8/fc_remote_ports/rport-10:0-8/port_state
      Online
      /sys/devices/pci0000:40/0000:40:01.1/0000:41:00.0/host10/rport-10:0-6/fc_remote_ports/rport-10:0-6/port_state
      Online
      /sys/devices/pci0000:40/0000:40:01.1/0000:41:00.0/host10/rport-10:0-4/fc_remote_ports/rport-10:0-4/port_state
      Online
      

       

      Please provide the package NVR for which the bug is seen:

      1. rpm -qa device-mapper-multipath
        device-mapper-multipath-0.8.7-34.el9.x86_64
      1. uname -r
        5.14.0-528.el9.x86_64

        How reproducible is this bug?: Often

        Steps to reproduce

      2. See above 

              rhn-engineering-bmarzins Benjamin Marzinski
              mpatalan Marco Patalano
              Benjamin Marzinski Benjamin Marzinski
              Lin Li Lin Li
              Votes:
              0 Vote for this issue
              Watchers:
              8 Start watching this issue

                Created:
                Updated: