-
Bug
-
Resolution: Unresolved
-
Undefined
-
None
-
4.16.z, 4.18.0, 4.19.0, 4.20.0
-
Quality / Stability / Reliability
-
False
-
-
None
-
Important
-
None
-
-
None
-
None
-
Rejected
-
CNF Compute Sprint 260, CNF Compute Sprint 261, CNF Compute Sprint 262, CNF Compute Sprint 263, CNF Compute Sprint 264, CNF Compute Sprint 265, CNF Compute Sprint 266, CNF Compute Sprint 267, CNF Compute Sprint 268, CNF Compute Sprint 269, CNF Compute Sprint 270, CNF Compute Sprint 271, CNF Compute Sprint 272, CNF Compute Sprint 273, CNF Compute Sprint 274, CNF Compute Sprint 275, CNF Compute Sprint 276, CNF Compute Sprint 277
-
18
-
Done
-
Known Issue
-
-
None
-
None
-
None
-
None
Description of problem:
On a SNO node which has RAN profile enabled (many SriovNetworks), tuned profile got degraded after the node reboot due to error below: Message: TuneD daemon issued one or more error message(s) during profile application. TuneD stderr: ERROR tuned.utils.commands: Executing 'ethtool -l ens2f0v1' error: netlink error: no device matches name (offset 24)
Version-Release number of selected component (if applicable):
4.16.10, maybe other 4.16 versions as well
How reproducible:
Looks like a race condition issue, if there are more PF/VFs in the SriovNetwork, the issue will happen more often.
Steps to Reproduce:
1. Install a SNO cluster with RAN profile applied, usetLevelNetworking is enabled in PerformanceProfile. 2. Make sure many SriovNetworkNodePolicy are created on the cluster 3. Reboot the cluster and check profile: oc get profile -A
Actual results:
Sometime profile got degraded, when running 'oc describe profile -A', got error like: Message: TuneD daemon issued one or more error message(s) during profile application. TuneD stderr: ERROR tuned.utils.commands: Executing 'ethtool -l ens2f0v1' error: netlink error: no device matches name (offset 24)
Expected results:
Profile should not be degraded
Additional info:
When restarting tuned pod it cleared the issue. Attach the sriov CRs so to reproduce the issue: https://drive.google.com/file/d/1qIjF-fXJeBcu_esp8P-kT9RQ6OJ6Zp_i/view?usp=drive_link
- depends on
-
FDP-1399 Please add option which will wait for udev settle during startup
-
- Closed
-
-
FDP-1425 pyudev traceback when sync polling for the events
-
- Closed
-
- is caused by
-
RHEL-60906 tuned: Executing 'ethtool -l eth0' error: netlink error: no device matches name
-
- In Progress
-
- is related to
-
OCPBUGS-56442 OCP 4.18+ | Node Tuning Operator is marked as degraded during IPI wait-for-install process
-
- Verified
-
-
RHEL-88238 Please add option which will wait for udev settle during startup
-
- Release Pending
-