Uploaded image for project: 'RHEL'
  1. RHEL
  2. RHEL-73706

NFS client TLS spurious -EPERM on LIF migrate

Linking RHIVOS CVEs to...Migration: Automation ...Sync from "Extern...XMLWordPrintable

    • kernel-5.14.0-571.el9
    • No
    • Critical
    • rhel-fs
    • 3
    • 5
    • 7
    • False
    • False
    • Hide

      None

      Show
      None
    • None
    • None
    • Proposed Exception
    • None

      I continue to see "Permission denied" errors with both NFSv3 and NFSv4.1 mounts:

       

      root@scs000379747:~ #mount |grep /mnt
      10.224.118.192:/data/ on /mnt type nfs (rw,relatime,vers=3,rsize=1048576,wsize=1048576,namlen=255,hard,proto=tcp,timeo=600,retrans=2,sec=sys,xprtsec=tls,mountaddr=10.224.118.192,mountvers=3,mountport=635,mountproto=udp,local_lock=none,addr=10.224.118.192)
      root@scs000379747:~ #cp -v ./1gb /mnt/
      './1gb' -> '/mnt/1gb'
      cp: failed to close '/mnt/1gb': Permission denied
      root@scs000379747:~ #uname -a
      Linux scs000379747 5.14.0-427.44.1.tls_fixes_4.el9_4.x86_64 #1 SMP PREEMPT_DYNAMIC Sat Jan 4 11:09:00 EST 2025 x86_64 x86_64 x86_64 GNU/Linux
      

       

      I'm able to copy other files though:

      root@scs000379747:~ #cp ./runonce.sh /mnt/
      root@scs000379747:~ #

      I'm running the op as root:

      root@scs000379747:~ #id
      uid=0(root) gid=0(root) groups=0(root) context=unconfined_u:unconfined_r:unconfined_t:s0-s0:c0.c1023
      root@scs000379747:~ #

      The test case is simple, to run a copy command in a loop on the client:

      for ((i=0;i<500;i++)) do echo iter $i; cp -v ./1gb /mnt/; done

      On the server, I migrate the lif with a similar loop:

      for ((i=0;i<10000;i++));
      do
        echo iteration $i
        ngsh -c " network interface migrate -vserver vs2 -lif data -destination-node dhairesh-vsim1"
        sleep 3
        ngsh -c "network interface migrate -vserver vs2 -lif data -destination-node dhairesh-vsim2"
      done

      After few iterations, the client gets into this state and copying the file keeps failing with this error. I stopped the LIF-migration script and it still fails.

      An unmount and remount fix the problem.

        1. WithRWSizeAs16k.zip
          1.40 MB
        2. WithDefaultRWSize.zip
          340 kB
        3. ktls-utils-0.11-RHEL_73706.el9_4.x86_64.rpm
          39 kB
        4. dmesgOutput.zip
          49 kB

              bcodding@redhat.com Benjamin Coddington
              dhairesh@netapp.com Dhairesh Oza (Inactive)
              NFS Team NFS Team
              Yongcheng Yang Yongcheng Yang
              Votes:
              0 Vote for this issue
              Watchers:
              10 Start watching this issue

                Created:
                Updated: