Loading...

XML

Word

Printable

Type: Bug
Resolution: Unresolved
Priority: Major
Fix Version/s: compliance-operator-1.10.0
Affects Version/s: None
Component/s: Compliance Operator
Labels:
- triaged

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Intelligence Requested:
Market:

Severity:
Moderate

SFDC Cases Links:
SFDC Cases Open:
SFDC Cases Counter:

PX Impact Score:

Description of problem:

After upgrading to OpenShift Container Platform 4.18.15, the customer noticed that on all Nodes, the "chrony-wait.service" is in status failed:

$ sudo systemctl status chrony-wait.service 
× chrony-wait.service - Wait for chrony to synchronize system clock
     Loaded: loaded (/usr/lib/systemd/system/chrony-wait.service; disabled; preset: disabled)
     Active: failed (Result: timeout) since Wed 2025-07-09 12:27:39 UTC; 42min ago
       Docs: man:chronyc(1)
   Main PID: 1434 (code=exited, status=1/FAILURE)
        CPU: 113ms

Jul 09 12:24:39 xxx-01-worker-az1-dp6ck systemd[1]: Starting Wait for chrony to synchronize system clock...
Jul 09 12:27:39 xxx-01-worker-az1-dp6ck systemd[1]: chrony-wait.service: start operation timed out. Terminating.
Jul 09 12:27:39 xxx-01-worker-az1-dp6ck systemd[1]: chrony-wait.service: Main process exited, code=exited, status=1/FAILURE
Jul 09 12:27:39 xxx-01-worker-az1-dp6ck systemd[1]: chrony-wait.service: Failed with result 'timeout'.
Jul 09 12:27:39 xxx-01-worker-az1-dp6ck systemd[1]: Failed to start Wait for chrony to synchronize system clock.

The "chronyd.service" is working as expected. In ~~OCPBUGS-59281~~ we then discovered that the "rhcos4-moderate" policy recommends setting the following in "/etc/chrony.conf":

# Set chronyd as client-only.
port 0

# Disable chronyc from the network
cmdport 0

This is consistent with the following rules:

However, this setting leads to the "chrony-wait.service" timing out with the following error messages:

# /usr/bin/chronyc -h 127.0.0.1,::1 waitsync 0 0.1 0.0 1
506 Cannot talk to daemon
506 Cannot talk to daemon
[..]

We observe this issue on all clusters that were upgraded to OpenShift Container Platform 4.18.15.

Version-Release number of selected component (if applicable):

OpenShift Container Platform 4.18.15

How reproducible:

Always

Steps to Reproduce:

1. Install a cluster with OpenShift Container Platform 4.18.15
2. Install the Compliance Operator and apply the "rhcos4-moderate" profile, remediate the "chrony" findings mentioned above
3. Restart the OpenShift Nodes
4. Log into an OpenShift Node using SSH
5. Observe that the login message already shows there is a failed service ("chrony-wait.service")
6. Execute "sudo systemctl status chrony-wait.service"

Actual results:

The service shows: "chrony-wait.service: Failed with result 'timeout'." due to the remediation being applied

Expected results:

With the profile "rhcos4-moderate" applied, there are no failed services.
The chrony-wait service finishes as expected.

Additional info:

Findings in ~~OCPBUGS-59281~~
sosreport available in attached Support Case
must-gather available in attached Support Case

relates to

OCPBUGS-59281 "chrony-wait.service" fails on all Nodes with "506 Cannot talk to daemon"

Closed

Assignee:: Unassigned

Reporter:: Simon Krenger

QA Contact:: Xiaojie Yuan

Product Manager:: Maria Simon Marcos

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Created:: 2025/07/17 11:03 AM

Updated:: 2026/02/12 2:20 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates