Loading...

XML

Word

Printable

Type: Bug
Resolution: Done-Errata
Priority: Critical
Fix Version/s: 4.19.0
Affects Version/s: 4.19
Component/s: Node / CRI-O
Labels:
- triaged

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
None
Regression:
Yes

Target Backport Versions:
None
Target Version:

4.19.0
Release Blocker:
Approved
Sprint:
None

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Release Note Status:
Done
Release Note Type:
Release Note Not Required
Release Note Text:
N/A

Escape Reason:
None
Escape Impact:
None
Corrective Measures:
None
SDLC stage when should've been found:
None

Description of problem:

    MicroShift fails to start because kubelet cannot talk to crio over a socket because crio is SIGABRT'ed by systemd.

CRIO complains for a minute that it "Failed to get network for name: ovn-kubernetes". Then following is logged:

msg="Will not notify watchdog because CRI-O is unhealthy: health checker failed: runtime status \"NetworkReady\" is invalid: Network plugin returns error: no CNI configu
ration file in /etc/cni/net.d/. Has your network provider started? (reason: NetworkPluginNotReady)" file="watchdog/watchdog.go:64"

systemd[1]: crio.service: Watchdog timeout (limit 1min)!
systemd[1]: crio.service: Killing process 22355 (crio) with signal SIGABRT.

Version-Release number of selected component (if applicable):

crio 1.32

How reproducible:

Everytime with 1.32 in our CI

Steps to Reproduce:

    1. Install MicroShift, install crio 1.32 (from the ocp mirror).
    2. `systemctl start microshift`
    3.

Actual results:

    Starting microshift fails (it eventually starts when everything catches up to, but first start is important in CI).

Expected results:

    crio isn't killed because the CNI is not ready yet. MicroShift starts because kubelet can contact crio.

Additional info:

    Example journalctl (search for SIGABRT): https://gcsweb-ci.apps.ci.l2s4.p1.openshiftapps.com/gcs/test-platform-results/logs/periodic-ci-openshift-microshift-release-4.19-periodics-e2e-aws-tests-bootc-arm-nightly/1878623192738697216/artifacts/e2e-aws-tests-bootc-arm-nightly/openshift-microshift-e2e-metal-tests/artifacts/scenario-info/el95-src@isolated-net/vms/host1/sos/journal_2025-01-13_02:58:46.log

Here's crio trace log from another machine:
https://drive.google.com/file/d/1sQHFzV_cJOPMBkbItBIocusPtWKSyjp1/view?usp=sharing

links to

RHEA-2024:11038 OpenShift Container Platform 4.19.z bug fix update

Assignee:: Peter Hunt

Reporter:: Patryk Matuszak

QA Contact:: Aditi Sahay

Need Info From:: None

Votes:: 0 Vote for this issue

Watchers:: 11 Start watching this issue

Created:: 2025/01/13 3:56 PM

Updated:: 2025/07/17 1:15 PM

Resolved:: 2025/06/17 4:46 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates