-
Bug
-
Resolution: Unresolved
-
Undefined
-
None
-
False
-
-
False
-
None
-
rhel-storage-management
-
-
We have a couple of CKI checkouts from rhel7.9 official builds failing due to lack for semaphores on LVM related testing (storage/lvm/snapper):
https://datawarehouse.cki-project.org/kcidb/tests/16118512
INFO: Creating LVs INFO: [2025-01-23 15:36:21] Running: 'lvcreate --thinpool test_pool -L 800M vgtest'... Thin pool volume with chunk size 64.00 KiB can address at most 15.81 TiB of data. Limit for the maximum number of semaphores reached. You can check and set the limits in /proc/sys/kernel/sem. Unable to resume vgtest-lvol0 (253:3). Failed to activate new LV. Limit for the maximum number of semaphores reached. You can check and set the limits in /proc/sys/kernel/sem. Unable to deactivate vgtest-lvol0 (253:3). Unable to deactivate failed new LV vgtest/lvol0. Manual intervention required. FAIL: Could not create test_pool - ext4
Failed ocurred on :
kvm-05-guest16.lab.eng.rdu2.dc.redhat.com
kvm-04-guest27.lab.eng.rdu2.redhat.com
always on x86_64 builds, rt build cannot replicate the failure so far.
I've reserved and ran individually the test on one of these hosts and i cannot reproduce the problem either :
============================================================================================================== Test name: lvm/snapper/snapper_basic.py Status: PASS Elapsed Time: 09m18s ============================================================================================================== Total - PASS: 1 FAIL: 0 SKIP: 0 WARN: 0 Total Time: 09m18s ============================================================================================================== [root@kvm-05-guest16 snapper]# uname -a Linux kvm-05-guest16.lab.eng.rdu2.dc.redhat.com 3.10.0-1160.131.1.el7.x86_64 #1 SMP Thu Jan 23 07:18:40 EST 2025 x86_64 x86_64 x86_64 GNU/Linux
Noted there were some previous segfault error from openposix_testsuite test :
[ 620.227926] mmap_6-1.run-te[30629]: segfault at 7f542fd6e000 ip 0000000000400b04 sp 00007ffe98a8b830 error 6 in mmap_6-1.run-test[400000+1000] [ 620.245278] mmap_6-2.run-te[30641]: segfault at 7f5a6b397000 ip 0000000000400bb3 sp 00007ffcdfe84d90 error 4 in mmap_6-2.run-test[400000+2000] [ 620.261684] mmap_6-3.run-te[30653]: segfault at 7f74193af000 ip 0000000000400ab1 sp 00007ffd0e8918e0 error 6 in mmap_6-3.run-test[400000+1000] [ 814.042153] pthread_create_[6853]: segfault at 7f3bfac36e68 ip 000000000040211d sp 00007f3bfac36e60 error 6 in pthread_create_1-5.run-test[400000+4000] [ 814.047771] pthread_create_[6858]: segfault at 7f3bfac36e68 ip 000000000040211d sp 00007f3bfac36e60 error 6 in pthread_create_1-5.run-test[400000+4000] [ 814.121588] pthread_create_[6934]: segfault at 7f8ec9067e68 ip 0000000000402096 sp 00007f8ec9067e60 error 6 in pthread_create_3-2.run-test[400000+5000] [ 814.125511] pthread_create_[6936]: segfault at 7f8ec9067e68 ip 0000000000402096 sp 00007f8ec9067e60 error 6 in pthread_create_3-2.run-test[400000+5000] [ 814.129324] pthread_create_[6938]: segfault at 7f8ec9067e68 ip 0000000000402096 sp 00007f8ec9067e60 error 6 in pthread_create_3-2.run-test[400000+5000] [ 814.134220] pthread_create_[6940]: segfault at 7f8ec9067e68 ip 0000000000402096 sp 00007f8ec9067e60 error 6 in pthread_create_3-2.run-test[400000+5000] [ 814.139313] pthread_create_[6942]: segfault at 7f8ec9067e68 ip 0000000000402096 sp 00007f8ec9067e60 error 6 in pthread_create_3-2.run-test[400000+5000]
Also ran this test before snapper and i could replicate the segfault failure but not the semaphore problem.
As another triage testing, i ran 5 of the previous testing individually, before running again snapper test, but could not replicate the failure.
lvm commands may indeed fail if semaphore shortage, but not sure what's causing semaphore depletion, it might not be the test itself but something that rans previously, hence
im assigning this to CKI team for the moment, please reassign to a new owner if appropiate
- links to