-
Bug
-
Resolution: Unresolved
-
Normal
-
None
-
rhel-8.6.0.z
-
None
-
None
-
sst_cs_plumbers
-
ssg_core_services
-
8
-
False
-
-
None
-
None
-
None
-
None
-
None
Description of problem:
Openshift node becomes unstable and pods stay in "containercreating" state. Associated with the containers is this error: Jan 12 13:35:39 master1.local dbus-daemon[2412]: [system] Failed to activate service 'org.freedesktop.systemd1': timed out (service_start_timeout=25000ms) Systemd becomes unresponsive an seems like everything started with this error: Jan 12 03:32:22 master1.local kernel: systemd[1]: segfault at 38 ip 00007fdab5443952 sp 00007fff90c2ee18 error 4 in libmount.so.1.1.0[7fdab5433000+56000] Jan 12 03:32:22 master1.local kernel: Code: 31 c0 c3 0f 1f 44 00 00 f3 0f 1e fa 48 85 ff 74 0a 48 89 b7 c8 00 00 00 31 c0 c3 b8 ea ff ff ff c3 0f 1f 80 00 00 00 00 f3 0f <1e> fa 31 c0 48 85 ff 74 15 48 83 7f 38 00 75 0e 48 8b 47 30 c3 66 Jan 12 03:32:22 master1.local kernel: Core dump to |/usr/bin/getcoreinfo.sh systemd 11 28757 1705030342 master1.local pipe failed Jan 12 03:32:22 master1.local systemd[1]: Caught <SEGV>, core dump failed (child 28757, code=killed, status=11/SEGV). Jan 12 03:32:22 master1.local systemd[1]: Freezing execution.
Version-Release number of selected component (if applicable):
Openshift 4.12.10 RHCOS 412.86.202303241612-0 based on RHEL 8.6 Kernel 4.18.0-372.49.1.el8_6.x86_64 systemd 239-58.el8_6.10.x86_64
How reproducible:
N/A, it happened more than once on the same node
Steps to Reproduce:
1. 2. 3.
Actual results:
Node has to be rebooted in order to recover
Expected results:
No error
Additional info: