Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-38611

The Catalog Operator attempts to connect to deleted catalogSources

XMLWordPrintable

    • Important
    • No
    • False
    • Hide

      None

      Show
      None

      This is a clone of issue OCPBUGS-38290. The following is the description of the original issue:

      Description of problem:

      OLM still check the deleted catsrc of openshift-marketplace

      Version-Release number of selected component (if applicable):

      4.13

      How reproducible:

      not always

      Steps to Reproduce:

      https://gcsweb-qe-private-deck-ci.apps.ci.l2s4.p1.openshiftapps.com/gcs/qe-private-deck/logs/periodic-ci-openshift-openshift-tests-private-release-4.13-amd64-nightly-gcp-ipi-sdn-p1-f7/1632127504539979776/artifacts/gcp-ipi-sdn-p1-f7/openshift-extended-test/build-log.txt

       

      In daily CI, we met this issue several times.
      for example:
      https://gcsweb-qe-private-deck-ci.apps.ci.l2s4.p1.openshiftapps.com/gcs/qe-private-deck/logs/periodic-ci-openshift-openshift-tests-private-release-4.13-amd64-nightly-gcp-ipi-sdn-p1-f7/1632127504539979776/artifacts/gcp-ipi-sdn-p1-f7/openshift-extended-test/build-log.txt
      
      prometheus-dependency1-cs has been deleted, but many sub are installed failed due to ErrorPreventedResolution.
      
      "message": "failed to populate resolver cache from source prometheus-dependency1-cs/openshift-marketplace: failed to list bundles: rpc error: code = Unavailable desc = connection error: desc = \"transport: Error while dialing dial tcp: lookup prometheus-dependency1-cs.openshift-marketplace.svc on 172.30.0.10:53: no such host\"",
                      "reason": "ErrorPreventedResolution",
                      "status": "True",
                      "type": "ResolutionFailed"
      
      2023-03-04T22:35:00.761837299Z time="2023-03-04T22:35:00Z" level=info msg="removed client for deleted catalogsource" source="{prometheus-dependency1-cs openshift-marketplace}"
      
       4114 2023-03-04T22:39:38.039489890Z E0304 22:39:38.039410       1 queueinformer_operator.go:298] sync "e2e-test-olm-a-fa98jfef-sxnxr" failed: failed to populate resolver cach      e from source prometheus-dependency1-cs/openshift-marketplace: failed to list bundles: rpc error: code = Unavailable desc = connection error: desc = "transport: Error wh      ile dialing dial tcp: lookup prometheus-dependency1-cs.openshift-marketplace.svc on 172.30.0.10:53: no such host"

      Actual results:

      The deleted catsrc impacts sub installation.

      Expected results:

      The deleted catsrc should not impact sub installation.

      Additional info:

       

            lmohanty@redhat.com Lalatendu Mohanty
            anik120 Anik Bhattacharjee
            Jian Zhang Jian Zhang
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

              Created:
              Updated:
              Resolved: