-
Bug
-
Resolution: Obsolete
-
Undefined
-
None
-
4.12.z
-
No
-
CNF RAN Sprint 242, CNF RAN Sprint 243, CNF RAN Sprint 244
-
3
-
False
-
-
9/19: telco priority pending
-
Description of problem:
While upgrading 3557 from 4.12.27 to 4.12.29 and precaching all clusters before the upgrade. One cluster failed early during the precaching with the following in the logs # oc --kubeconfig /root/hv-vm/kc/vm00311/kubeconfig logs -n openshift-talo-pre-cache pre-cache-47nlx % Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0curl: (7) Failed to connect to kubernetes.default.svc port 443: Connection refusedhighThresholdPercent: diskSize:125293548 used:30470528ERROR: not enough space for precaching It appears that the precaching script failed to complete a curl request because of an unexpected intermittent api outage. https://github.com/openshift-kni/cluster-group-upgrades-operator/blob/release-4.13/pre-cache/check_space#L8 Shouldn't the script retry and or have the job pod retry?
Version-Release number of selected component (if applicable):
ACM - 2.9.0-DOWNSTREAM-2023-08-28-21-42-15 Hub OCP 4.13.10 Deployed SNOs - 4.12.27 TALM - 4.13.0 (Threaded with 5 threads)
How reproducible:
Steps to Reproduce:
1. 2. 3.
Actual results:
Expected results:
Additional info:
- blocks
-
OCPBUGS-19520 Cluster failed to precache and did not retry because of "Failed to connect to kubernetes.default.svc port 443: Connection refused"
- ON_QA
- is cloned by
-
OCPBUGS-19520 Cluster failed to precache and did not retry because of "Failed to connect to kubernetes.default.svc port 443: Connection refused"
- ON_QA
- links to
-
RHEA-2023:112754 OpenShift Container Platform 4.14.0 CNF vRAN extras update
- mentioned on