Uploaded image for project: 'Red Hat Advanced Cluster Management'
  1. Red Hat Advanced Cluster Management
  2. ACM-21045

Editing or deleting the ClusterInstance CR for the previously deployed cluster failed with error about missing webhook, but the webhook exists after upgrade ACM 2.12 to 2.13 with site-config enabled

XMLWordPrintable

    • 3
    • False
    • Hide

      None

      Show
      None
    • False
    • Installer Sprint 2025-59, Installer Sprint 2025-60
    • Important
    • None

      Description of problem:

      Upgrade of ACM from 2.12 to 2.13.2 with SiteConfig operator enabled. A cluster was deployed and managed under 2.12. Following the upgrade:

      • The SiteConfig operator pod in open-cluster-management namespace was not recreated (age showed 12d when all other pods were ~3h)
      • Editing or deleting the ClusterInstance CR for the previously deployed cluster failed with error about missing webhook, but the webhook exists:
        $ oc delete clusterinstance -n cnfdf02 cnfdf02
        Error from server (InternalError): Internal error occurred: failed calling webhook "clusterinstances.siteconfig.open-cluster-management.io": failed to call webhook: Post "https://webhook-clusterinstances-siteconfig-open-cluster-management-io.open-cluster-management.svc:443/validate-siteconfig-open-cluster-management-io-v1alpha1-clusterinstance?timeout=10s": no endpoints available for service "webhook-clusterinstances-siteconfig-open-cluster-management-io"
        $ oc get svc -n open-cluster-management
        <snip>
        webhook-clusterinstances-siteconfig-open-cluster-management-io   ClusterIP   172.30.110.105   <none>        443/TCP    7h40m
        Hub cluster is 3-node cluster. Dual-stack networking w/ ipv4 primary.

        Version-Release number of selected component (if applicable):

      ACM Upgrade from 2.12 to 2.13

      How reproducible:

      100%

      Steps to Reproduce:

      1. Install ACM 2.12
      2. enable SiteConfig Operator
      3. Upgrade to 2.13
      4. Try to create or delete a ClusterInstance CR

      Actual results:

      Error "failed calling webhook"

      Expected results:

      Success in creating/deleting ClusterInstance

      Additional info:

      https://access.redhat.com/solutions/7116347

       

      Resolution

      A hyper-specific check was added to delete deployment/siteconfig-controller-manager only when upgrading from 2.12 and if siteconfig is enabled. QE should only need to test if the upgrade works from 2.12 to 2.13 with siteconfig enabled.

              rh-ee-ngraham Nathaniel Graham
              rhn-support-imiller Ian Miller
              Matthew Smigielski Matthew Smigielski
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated:
                Resolved: