Uploaded image for project: 'Red Hat Advanced Cluster Management'
  1. Red Hat Advanced Cluster Management
  2. ACM-21311

IBI Installation fails with error: "no endpoints available for service "image-based-install-webhook"'

XMLWordPrintable

    • False
    • Hide

      None

      Show
      None
    • False
    • Critical
    • None

      Description of problem:

      Image-based installations fail to complete due to error "  no endpoints available for service "image-based-install-webhook"' in ClusterInstance

      Version-Release number of selected component (if applicable):

      Hub: OCP 4.19.0-rc.5, ACM 2.13.3, MCE 2.8.2 
      Spoke: OCP 4.19.0-rc.5

      How reproducible: Frequently. Possibly constantly.

      Steps to Reproduce:

      1. Begin image-based install on spoke cluster via GitOps workflow
      2. Clusterinstance CR is created
        NAMESPACE   NAME      PAUSED   PROVISIONSTATUS   PROVISIONDETAILS   AGE
        helix82     helix82                                                 10h
      3. ICI, BMH, ClusterDeployment are not created
      4. Error in ClusterInstance:
          - apiGroup: extensions.hive.openshift.io/v1alpha1
            kind: ImageClusterInstall
            lastAppliedTime: "2025-06-09T02:45:11Z"
            message: 'failed to create rendered object: Internal error occurred: failed calling
              webhook "imageclusterinstalls.extensions.hive.openshift.io": failed to call
              webhook: Post "https://image-based-install-webhook.multicluster-engine.svc:443/validate-extensions-hive-openshift-io-v1alpha1-imageclusterinstall?timeout=10s":
              no endpoints available for service "image-based-install-webhook"'

      Actual results: IBI Fails

      Expected results: IBI Succeeds

      Additional info:

      Relevant siteconfig pod log snippet

      2025-06-09T13:07:15.393Z    INFO    ClusterInstanceController    controller/clusterinstance_controller.go:527    Successfully rendered templates    {"name": "helix82", "namespace": "helix82", "version": "3545675"}
      2025-06-09T13:07:15.410Z    INFO    ClusterInstanceController    controller/clusterinstance_controller.go:542    Finished rendering templates    {"name": "helix82", "namespace": "helix82", "version": "3545675"}
      2025-06-09T13:07:15.410Z    INFO    ClusterInstanceController.pruneManifests    controller/clusterinstance_controller.go:818    No objects to prune; skipping pruning operation    {"name": "helix82", "namespace": "helix82", "version": "3545675"}
      2025-06-09T13:07:15.410Z    INFO    ClusterInstanceController.validateRenderedManifests    controller/clusterinstance_controller.go:722    Executing a dry-run validation on the rendered manifests    {"name": "helix82", "namespace": "helix82", "version": "3545675"}
      2025-06-09T13:07:15.610Z    INFO    ClusterInstanceController.validateRenderedManifests    controller/clusterinstance_controller.go:733    , err: failed to create rendered object: Internal error occurred: failed calling webhook "imageclusterinstalls.extensions.hive.openshift.io": failed to call webhook: Post "https://image-based-install-webhook.multicluster-engine.svc:443/validate-extensions-hive-openshift-io-v1alpha1-imageclusterinstall?timeout=10s": no endpoints available for service "image-based-install-webhook"    {"name": "helix82", "namespace": "helix82", "version": "3545675"}
      2025-06-09T13:07:15.631Z    INFO    ClusterInstanceController.validateRenderedManifests    controller/clusterinstance_controller.go:756    Finished executing a dry-run validation on the rendered manifests    {"name": "helix82", "namespace": "helix82", "version": "3545675"}
      2025-06-09T13:07:15.631Z    INFO    ClusterInstanceController    controller/clusterinstance_controller.go:149    Finished reconciling ClusterInstance    {"name": "helix82", "namespace": "helix82", "version": "3545675"}
      2025-06-09T13:07:15Z    INFO    Warning: Reconciler returned both a non-zero result and a non-nil error. The result will always be ignored if the error is non-nil and the non-nil error causes reqeueuing with exponential backoff. For more details, see: https://pkg.go.dev/sigs.k8s.io/controller-runtime/pkg/reconcile#Reconciler    {"controller": "clusterinstance", "controllerGroup": "siteconfig.open-cluster-management.io", "controllerKind": "ClusterInstance", "ClusterInstance": {"name":"helix82","namespace":"helix82"}, "namespace": "helix82", "name": "helix82", "reconcileID": "5ed4489c-5643-4742-a69b-07ad21e7e74c"}
      2025-06-09T13:07:15Z    ERROR    Reconciler error    {"controller": "clusterinstance", "controllerGroup": "siteconfig.open-cluster-management.io", "controllerKind": "ClusterInstance", "ClusterInstance": {"name":"helix82","namespace":"helix82"}, "namespace": "helix82", "name": "helix82", "reconcileID": "5ed4489c-5643-4742-a69b-07ad21e7e74c", "error": "failed to create rendered object: Internal error occurred: failed calling webhook \"imageclusterinstalls.extensions.hive.openshift.io\": failed to call webhook: Post \"https://image-based-install-webhook.multicluster-engine.svc:443/validate-extensions-hive-openshift-io-v1alpha1-imageclusterinstall?timeout=10s\": no endpoints available for service \"image-based-install-webhook\"", "errorCauses": [{"error": "failed to create rendered object: Internal error occurred: failed calling webhook \"imageclusterinstalls.extensions.hive.openshift.io\": failed to call webhook: Post \"https://image-based-install-webhook.multicluster-engine.svc:443/validate-extensions-hive-openshift-io-v1alpha1-imageclusterinstall?timeout=10s\": no endpoints available for service \"image-based-install-webhook\""}]}
       
      $ oc get services -n rhacm webhook-clusterinstances-siteconfig-open-cluster-management-io 
      NAME                                                             TYPE        CLUSTER-IP      EXTERNAL-IP   PORT(S)   AGE
      webhook-clusterinstances-siteconfig-open-cluster-management-io   ClusterIP   172.30.132.59   <none>        443/TCP   3d20h
       

              Unassigned Unassigned
              josclark@redhat.com Joshua Clark
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

                Created:
                Updated:
                Resolved: