Uploaded image for project: 'RHEL Testing'
  1. RHEL Testing
  2. RHELTEST-720

storage/lvm/snapper: lvm commands failing due to lack of semaphores

XMLWordPrintable

    • False
    • Hide

      None

      Show
      None
    • False
    • None
    • rhel-storage-management

      We have a couple of CKI checkouts from rhel7.9 official builds failing due to lack for semaphores on LVM related testing (storage/lvm/snapper):

      https://datawarehouse.cki-project.org/kcidb/tests/16118512

      INFO: Creating LVs
      INFO: [2025-01-23 15:36:21] Running: 'lvcreate --thinpool test_pool -L 800M vgtest'...
        Thin pool volume with chunk size 64.00 KiB can address at most 15.81 TiB of data.
        Limit for the maximum number of semaphores reached. You can check and set the limits in /proc/sys/kernel/sem.
        Unable to resume vgtest-lvol0 (253:3).
        Failed to activate new LV.
        Limit for the maximum number of semaphores reached. You can check and set the limits in /proc/sys/kernel/sem.
        Unable to deactivate vgtest-lvol0 (253:3).
        Unable to deactivate failed new LV vgtest/lvol0. Manual intervention required.
      FAIL: Could not create test_pool - ext4

      Failed ocurred on :

      kvm-05-guest16.lab.eng.rdu2.dc.redhat.com

      kvm-04-guest27.lab.eng.rdu2.redhat.com

      always on x86_64 builds, rt build cannot replicate the failure so far.

      I've reserved and ran individually the test on one of these hosts and i cannot reproduce the problem either :

      ==============================================================================================================
         Test name: lvm/snapper/snapper_basic.py Status: PASS       Elapsed Time: 09m18s
      ==============================================================================================================
      Total - PASS: 1          FAIL: 0          SKIP: 0          WARN: 0                       Total Time: 09m18s
      ==============================================================================================================
      [root@kvm-05-guest16 snapper]# uname -a
      Linux kvm-05-guest16.lab.eng.rdu2.dc.redhat.com 3.10.0-1160.131.1.el7.x86_64 #1 SMP Thu Jan 23 07:18:40 EST 2025 x86_64 x86_64 x86_64 GNU/Linux 

      Noted there were some previous segfault error from openposix_testsuite test :

      [  620.227926] mmap_6-1.run-te[30629]: segfault at 7f542fd6e000 ip 0000000000400b04 sp 00007ffe98a8b830 error 6 in mmap_6-1.run-test[400000+1000] 
      [  620.245278] mmap_6-2.run-te[30641]: segfault at 7f5a6b397000 ip 0000000000400bb3 sp 00007ffcdfe84d90 error 4 in mmap_6-2.run-test[400000+2000] 
      [  620.261684] mmap_6-3.run-te[30653]: segfault at 7f74193af000 ip 0000000000400ab1 sp 00007ffd0e8918e0 error 6 in mmap_6-3.run-test[400000+1000] 
      [  814.042153] pthread_create_[6853]: segfault at 7f3bfac36e68 ip 000000000040211d sp 00007f3bfac36e60 error 6 in pthread_create_1-5.run-test[400000+4000] 
      [  814.047771] pthread_create_[6858]: segfault at 7f3bfac36e68 ip 000000000040211d sp 00007f3bfac36e60 error 6 in pthread_create_1-5.run-test[400000+4000] 
      [  814.121588] pthread_create_[6934]: segfault at 7f8ec9067e68 ip 0000000000402096 sp 00007f8ec9067e60 error 6 in pthread_create_3-2.run-test[400000+5000] 
      [  814.125511] pthread_create_[6936]: segfault at 7f8ec9067e68 ip 0000000000402096 sp 00007f8ec9067e60 error 6 in pthread_create_3-2.run-test[400000+5000] 
      [  814.129324] pthread_create_[6938]: segfault at 7f8ec9067e68 ip 0000000000402096 sp 00007f8ec9067e60 error 6 in pthread_create_3-2.run-test[400000+5000] 
      [  814.134220] pthread_create_[6940]: segfault at 7f8ec9067e68 ip 0000000000402096 sp 00007f8ec9067e60 error 6 in pthread_create_3-2.run-test[400000+5000] 
      [  814.139313] pthread_create_[6942]: segfault at 7f8ec9067e68 ip 0000000000402096 sp 00007f8ec9067e60 error 6 in pthread_create_3-2.run-test[400000+5000] 

      Also ran this test before snapper and i could replicate the segfault failure but not the semaphore problem.

      As another triage testing, i ran 5 of the previous testing individually, before running again snapper test, but could not replicate the failure.

      lvm commands may indeed fail if semaphore shortage, but not sure what's causing semaphore depletion, it might not be the test itself but something that rans previously, hence

      im assigning this to CKI team for the moment, please reassign to a new owner if appropiate

              rhn-support-fsuba Filip Suba
              rhn-support-rbergant Roberto Bergantinos Corpas
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated: