-
Bug
-
Resolution: Done-Errata
-
Normal
-
4.11
Description of problem:
Installations of private clusters (publish: Internal) fails on Azure Stack Hub due to reference the incorrect load balancer when attempting to crate worker nodes.
Version-Release number of selected component (if applicable):
4.11.0-0.nightly-2022-07-05-083948 (and previous nightlies)
How reproducible:
always
Steps to Reproduce:
1. Attempt to install a private cluster on ASH
2. wait until installer times out after removal of bootstrap
3. log in to cluster and observe that only master nodes exist
Actual results:
Incomplete cluster with no worker nodes
Expected results:
cluster installs successfully
Additional info:
core@mghaganproxy:~$ oc get machines -A
NAMESPACE NAME PHASE TYPE REGION ZONE AGE
openshift-machine-api mgahagan220706-ff6dd-master-0 Running Standard_DS4_v2 mtcazs 6h16m
openshift-machine-api mgahagan220706-ff6dd-master-1 Running Standard_DS4_v2 mtcazs 6h16m
openshift-machine-api mgahagan220706-ff6dd-master-2 Running Standard_DS4_v2 mtcazs 6h16m
openshift-machine-api mgahagan220706-ff6dd-worker-mtcazs-29hpn Failed 6h8m
openshift-machine-api mgahagan220706-ff6dd-worker-mtcazs-c6mr4 Failed 6h8m
openshift-machine-api mgahagan220706-ff6dd-worker-mtcazs-nl6h5 Failed 6h8m
openshift-machine-api mgahagan220706-ff6dd-worker-mtcazs-rtgk8 Failed 5h14m
inspecting one of the failed workers with oc describe we see:
Error Message: failed to reconcile machine "mgahagan220706-ff6dd-worker-mtcazs-c6mr4": network.LoadBalancersClient#Get: Failure responding to request: StatusCode=404 – Original Error: autorest/azure: Service returned an error. Status=404 Code="ResourceNotFound" Message="The Resource 'Microsoft.Network/loadBalancers/mgahagan220706-ff6dd' under resource group 'mgahagan220706-ff6dd-rg' was not found. For more details please go to https://aka.ms/ARMResourceNotFoundFix"
Given that this is a private cluster created with publish: Internal the proper load balancer to bind the worker node's nic to should be mgahagan220706-ff6dd-internal
- links to
-
RHEA-2024:11038 OpenShift Container Platform 4.19.z bug fix update