-
Bug
-
Resolution: Not a Bug
-
Undefined
-
None
-
None
-
False
-
-
False
-
-
-
-
Critical
-
None
Description of problem:
Image-based installations fail to complete due to error " no endpoints available for service "image-based-install-webhook"' in ClusterInstance
Version-Release number of selected component (if applicable):
Hub: OCP 4.19.0-rc.5, ACM 2.13.3, MCE 2.8.2
Spoke: OCP 4.19.0-rc.5
How reproducible: Frequently. Possibly constantly.
Steps to Reproduce:
- Begin image-based install on spoke cluster via GitOps workflow
- Clusterinstance CR is created
NAMESPACE NAME PAUSED PROVISIONSTATUS PROVISIONDETAILS AGE
helix82 helix82 10h - ICI, BMH, ClusterDeployment are not created
- Error in ClusterInstance:
- apiGroup: extensions.hive.openshift.io/v1alpha1
kind: ImageClusterInstall
lastAppliedTime: "2025-06-09T02:45:11Z"
message: 'failed to create rendered object: Internal error occurred: failed calling
webhook "imageclusterinstalls.extensions.hive.openshift.io": failed to call
webhook: Post "https://image-based-install-webhook.multicluster-engine.svc:443/validate-extensions-hive-openshift-io-v1alpha1-imageclusterinstall?timeout=10s":
no endpoints available for service "image-based-install-webhook"'
Actual results: IBI Fails
Expected results: IBI Succeeds
Additional info:
Relevant siteconfig pod log snippet
2025-06-09T13:07:15.393Z INFO ClusterInstanceController controller/clusterinstance_controller.go:527 Successfully rendered templates {"name": "helix82", "namespace": "helix82", "version": "3545675"} 2025-06-09T13:07:15.410Z INFO ClusterInstanceController controller/clusterinstance_controller.go:542 Finished rendering templates {"name": "helix82", "namespace": "helix82", "version": "3545675"} 2025-06-09T13:07:15.410Z INFO ClusterInstanceController.pruneManifests controller/clusterinstance_controller.go:818 No objects to prune; skipping pruning operation {"name": "helix82", "namespace": "helix82", "version": "3545675"} 2025-06-09T13:07:15.410Z INFO ClusterInstanceController.validateRenderedManifests controller/clusterinstance_controller.go:722 Executing a dry-run validation on the rendered manifests {"name": "helix82", "namespace": "helix82", "version": "3545675"} 2025-06-09T13:07:15.610Z INFO ClusterInstanceController.validateRenderedManifests controller/clusterinstance_controller.go:733 , err: failed to create rendered object: Internal error occurred: failed calling webhook "imageclusterinstalls.extensions.hive.openshift.io": failed to call webhook: Post "https://image-based-install-webhook.multicluster-engine.svc:443/validate-extensions-hive-openshift-io-v1alpha1-imageclusterinstall?timeout=10s": no endpoints available for service "image-based-install-webhook" {"name": "helix82", "namespace": "helix82", "version": "3545675"} 2025-06-09T13:07:15.631Z INFO ClusterInstanceController.validateRenderedManifests controller/clusterinstance_controller.go:756 Finished executing a dry-run validation on the rendered manifests {"name": "helix82", "namespace": "helix82", "version": "3545675"} 2025-06-09T13:07:15.631Z INFO ClusterInstanceController controller/clusterinstance_controller.go:149 Finished reconciling ClusterInstance {"name": "helix82", "namespace": "helix82", "version": "3545675"} 2025-06-09T13:07:15Z INFO Warning: Reconciler returned both a non-zero result and a non-nil error. The result will always be ignored if the error is non-nil and the non-nil error causes reqeueuing with exponential backoff. For more details, see: https://pkg.go.dev/sigs.k8s.io/controller-runtime/pkg/reconcile#Reconciler {"controller": "clusterinstance", "controllerGroup": "siteconfig.open-cluster-management.io", "controllerKind": "ClusterInstance", "ClusterInstance": {"name":"helix82","namespace":"helix82"}, "namespace": "helix82", "name": "helix82", "reconcileID": "5ed4489c-5643-4742-a69b-07ad21e7e74c"} 2025-06-09T13:07:15Z ERROR Reconciler error {"controller": "clusterinstance", "controllerGroup": "siteconfig.open-cluster-management.io", "controllerKind": "ClusterInstance", "ClusterInstance": {"name":"helix82","namespace":"helix82"}, "namespace": "helix82", "name": "helix82", "reconcileID": "5ed4489c-5643-4742-a69b-07ad21e7e74c", "error": "failed to create rendered object: Internal error occurred: failed calling webhook \"imageclusterinstalls.extensions.hive.openshift.io\": failed to call webhook: Post \"https://image-based-install-webhook.multicluster-engine.svc:443/validate-extensions-hive-openshift-io-v1alpha1-imageclusterinstall?timeout=10s\": no endpoints available for service \"image-based-install-webhook\"", "errorCauses": [{"error": "failed to create rendered object: Internal error occurred: failed calling webhook \"imageclusterinstalls.extensions.hive.openshift.io\": failed to call webhook: Post \"https://image-based-install-webhook.multicluster-engine.svc:443/validate-extensions-hive-openshift-io-v1alpha1-imageclusterinstall?timeout=10s\": no endpoints available for service \"image-based-install-webhook\""}]}
$ oc get services -n rhacm webhook-clusterinstances-siteconfig-open-cluster-management-io NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE webhook-clusterinstances-siteconfig-open-cluster-management-io ClusterIP 172.30.132.59 <none> 443/TCP 3d20h