Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-16411

awsendpointservice stuck deleting due to missing hosted zone

XMLWordPrintable

    • Moderate
    • No
    • SREP Team Rocket 239
    • 1
    • False
    • Hide

      None

      Show
      None

      This is a clone of issue OCPBUGS-16160. The following is the description of the original issue:

      This is a clone of issue OCPBUGS-16135. The following is the description of the original issue:

      Description of problem:

      The control-plane-operator pod gets stuck deleting an awsendpointservice if its hostedzone is already gone:

      Logs:

      {"level":"error","ts":"2023-07-13T03:06:58Z","msg":"Reconciler error","controller":"awsendpointservice","controllerGroup":"hypershift.openshift.io","controllerKind":"AWSEndpointService","aWSEndpointService":{"name":"private-router","namespace":"ocm-staging-24u87gg3qromrf8mg2r2531m41m0c1ji-diegohcp-west2"},"namespace":"ocm-staging-24u87gg3qromrf8mg2r2531m41m0c1ji-diegohcp-west2","name":"private-router","reconcileID":"59eea7b7-1649-4101-8686-78113f27567d","error":"failed to delete resource: NoSuchHostedZone: No hosted zone found with ID: Z05483711XJV23K8E97HK\n\tstatus code: 404, request id: f8686dd6-a906-4a5e-ba4a-3dd52ad50ec3","stacktrace":"sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).processNextWorkItem\n\t/hypershift/vendor/sigs.k8s.io/controller-runtime/pkg/internal/controller/controller.go:273\nsigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).Start.func2.2\n\t/hypershift/vendor/sigs.k8s.io/controller-runtime/pkg/internal/controller/controller.go:234"} 

      Version-Release number of selected component (if applicable):

      4.12.24

      How reproducible:

      Have not tried to reproduce yet, but should be fairly reproducible

      Steps to Reproduce:

      1. Install a PublicAndPrivate or Private HCP
      2. Delete the Route53 Hosted Zone defined in its awsendpointservice's .status.dnsZoneID field
      3. Observe the control-plane-operator looping on the above logs and the uninstall hanging
      

      Actual results:

      Uninstall hangs due to CPO being unable to delete the awsendpointservice

      Expected results:

      awsendpointservice cleans up, if the hosted zone is already gone CPO shouldn't care that it can't list hosted zones

      Additional info:

       

              mshen.openshift Michael Shen
              openshift-crt-jira-prow OpenShift Prow Bot
              Jie Zhao Jie Zhao
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

                Created:
                Updated:
                Resolved: