-
Bug
-
Resolution: Unresolved
-
Normal
-
None
-
4.22
-
None
-
None
-
False
-
-
None
-
Moderate
-
Yes
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
Description of problem:
When a config drift happens in a system unit or in a dropin file, and we create the /run/machine-config-daemon-force to re-apply the configuration, the config drift is not fixed anymore.
Version-Release number of selected component (if applicable):
4.22
How reproducible:
Always
Steps to Reproduce:
1. Create a MC to deploy a dropin file
kind: List
apiVersion: v1
metadata: {}
items:
- apiVersion: machineconfiguration.openshift.io/v1
kind: MachineConfig
metadata:
labels:
machineconfiguration.openshift.io/role: worker
name: drifted-dropins-test
spec:
config:
ignition:
version: 3.5.0
passwd:
users: []
storage:
files: []
systemd:
units:
- dropins:
- contents: '[Service]
Environment="FAKE_OPTS=fake-value"'
name: 10-chrony-drop-test.conf
enabled: true
name: chronyd.service
extensions: []
kernelArguments: []
osImageURL: ''
2. "oc debug" a node and manually modify the 10-chrony-drop-test.conf file
3. The MCP will become degraded reporting the config drif
4. "oc debug" into the node again and create the /run/machine-config-daemon-force file to re-apply the config
Actual results:
The file causing the config drift degradation is not re-applied, and the degradation is not fixed.
Expected results:
When the configuration is re-applied once we create the /run/machine-config-daemon-force file, the config drfit degradation should be fixed.
Additional info:
It impacts these test cases:
- "[sig-mco] MCO config drift Author:sregidor-Longduration-NonPreRelease-High-47008-Config Drift. Dropin file. [Serial]"
- "[sig-mco] MCO config drift Author:sregidor-Longduration-NonPreRelease-High-47009-Config Drift. New Service Unit. [Serial]"
This issue can be verified by running them and checking that they pass.
Likely related to: https://github.com/openshift/machine-config-operator/pull/5582
- is caused by
-
OCPBUGS-58023 machine-config-daemon is kill while updating the system-units on the node, leading to disabled system-units after reboot.
-
- Verified
-
- is related to
-
MCO-1954 Investigate what improvements can be made to the config drift monitor
-
- To Do
-