Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-29989

Fail to install operator on Provider mode cluster with ACM release-2.9

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Won't Do
    • Icon: Normal Normal
    • None
    • 4.14, 4.14.z
    • OLM
    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • None
    • Moderate
    • No
    • None
    • None
    • Rejected
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      Description of problem:

      Fail to deploy operator on Provider mode cluster with ACM release-2.9 
      The goal was to deploy ODF operator on hosted Client on OCP 4.14.12.
      There are multiple OCP Hosted clusters and all of them where failing to deploy ODF 4.14.5-8 Operator. 
      
      Subscription has CatalogSourceUnhealthy msg
      
      - lastTransitionTime: "2024-02-26T15:21:56Z"
          message: all available catalogsources are healthy
          reason: AllCatalogSourcesHealthy
          status: "False"
          type: CatalogSourcesUnhealthy
        - message: 'failed to populate resolver cache from source ocs-catalogsource/openshift-marketplace:
           error encountered while listing bundles: rpc error: code = DeadlineExceeded
            desc = context deadline exceeded'
          reason: ErrorPreventedResolution
          status: "True"
          type: ResolutionFailedAlthough the catalogsource is in 'READY'kind: CatalogSource
      metadata:
        creationTimestamp: "2024-02-26T15:21:44Z"
        generation: 1
        labels:
          ocs-operator-internal: "true"
        name: ocs-catalogsource
        namespace: openshift-marketplace
        resourceVersion: "212287"
        uid: 82d1a231-24dc-4c89-bcce-0f41f5c49d32
      spec:
        displayName: Openshift Container Storage
        image: quay.io/rhceph-dev/ocs-registry:4.14.5-8
        priority: 100
        publisher: Red Hat
        sourceType: grpc
      status:
        connectionState:
          address: ocs-catalogsource.openshift-marketplace.svc:50051
          lastConnect: "2024-02-27T0427Z"
          lastObservedState: READY
      
      downloads pod is in CLBO 
      
      openshift-console                                 downloads-787795d45b-knh22                               0/1    CrashLoopBackOff  51 (3m11s ago)  161m
      
      pull-secret and iscp were checked, image of the operator can be pulled with same pull-secred using podman
      
      ------ logs from olm-operator and catalog source -------
      
      
      # k logs -nclusters-hcp414-bm2-a olm-operator-f8b774fd-jwb4j --since 2m
      Defaulted container "olm-operator" out of: olm-operator, socks5-proxy, availability-prober (init)
      time="2024-02-27T10:35:22Z" level=error msg="initialization error - failed to ensure name=\"\" - ClusterOperator.config.openshift.io \"\\\"\\\"\" is invalid: metadata.name: Invalid value: \"\\\"\\\"\": a lowercase RFC 1123 subdomain must consist of lower case alphanumeric characters, '-' or '.', and must start and end with an alphanumeric character (e.g. 'example.com', regex used for validation is '[a-z0-9]([-a-z0-9]*[a-z0-9])?(\\.[a-z0-9]([-a-z0-9]*[a-z0-9])?)*')" monitor=clusteroperator
      time="2024-02-27T10:36:22Z" level=error msg="initialization error - failed to ensure name=\"\" - ClusterOperator.config.openshift.io \"\\\"\\\"\" is invalid: metadata.name: Invalid value: \"\\\"\\\"\": a lowercase RFC 1123 subdomain must consist of lower case alphanumeric characters, '-' or '.', and must start and end with an alphanumeric character (e.g. 'example.com', regex used for validation is '[a-z0-9]([-a-z0-9]*[a-z0-9])?(\\.[a-z0-9]([-a-z0-9]*[a-z0-9])?)*')" monitor=clusteroperator# k logs -nclusters-hcp414-bm2-a catalog-operator-7dfd5c59dd-2hqjj --since 2m | grep client
      Defaulted container "catalog-operator" out of: catalog-operator, socks5-proxy, availability-prober (init)
      E0227 10:35:58.032747       1 queueinformer_operator.go:319] sync "openshift-storage-client" failed: failed to populate resolver cache from source ocs-catalogsource/openshift-marketplace: error encountered while listing bundles: rpc error: code = DeadlineExceeded desc = context deadline exceeded
      I0227 10:35:58.032993       1 event.go:298] Event(v1.ObjectReference{Kind:"Namespace", Namespace:"", Name:"openshift-storage-client", UID:"8d343fd2-22d6-4bdf-8efb-3a6d0e05ecb0", APIVersion:"v1", ResourceVersion:"222051", FieldPath:""}): type: 'Warning' reason: 'ResolutionFailed' failed to populate resolver cache from source ocs-catalogsource/openshift-marketplace: error encountered while listing bundles: rpc error: code = DeadlineExceeded desc = context deadline exceeded
      E0227 10:36:58.032947       1 queueinformer_operator.go:319] sync {"update" "openshift-storage-client"} failed: failed to populate resolver cache from source ocs-catalogsource/openshift-marketplace: error encountered while listing bundles: rpc error: code = DeadlineExceeded desc = context deadline exceeded
      I0227 10:36:58.033116       1 event.go:298] Event(v1.ObjectReference{Kind:"Namespace", Namespace:"", Name:"openshift-storage-client", UID:"8d343fd2-22d6-4bdf-8efb-3a6d0e05ecb0", APIVersion:"v1", ResourceVersion:"222051", FieldPath:""}): type: 'Warning' reason: 'ResolutionFailed' failed to populate resolver cache from source ocs-catalogsource/openshift-marketplace: error encountered while listing bundles: rpc error: code = DeadlineExceeded desc = context deadline exceeded

       

      Version-Release number of selected component (if applicable):

        acm_version: "2.9"
        acm_hub_channel: "release-2.9"
      

      How reproducible:

          Deploy Provider mode cluster using Bare Metal machines, create kubevirt cluster using CNV 4.14 

      Steps to Reproduce:

          1. Deploy Provider mode cluster using Bare Metal machines     
          2. create kubevirt cluster using CNV 4.14 
          3. apply yaml files to install ODF:
      ---
      apiVersion: v1
      kind: Namespace
      metadata:
        name: openshift-storage-client
      ---
      apiVersion: operators.coreos.com/v1
      kind: OperatorGroup
      metadata:
        name: openshift-storage-client-operator-group
        namespace: openshift-storage-client
      spec:
        targetNamespaces:
          - openshift-storage-client
      ---
      apiVersion: operators.coreos.com/v1alpha1
      kind: Subscription
      metadata:
        name: ocs-client-operator
        namespace: openshift-storage-client
      spec:
        channel: stable-4.14
        installPlanApproval: Automatic
        name: ocs-client-operator
        source: ocs-catalogsource
        sourceNamespace: openshift-marketplace
      ---
      apiVersion: operators.coreos.com/v1alpha1
      kind: CatalogSource
      metadata:
        name: ocs-catalogsource
        namespace: openshift-marketplace
        labels:
          ocs-operator-internal: "true"
      spec:
        displayName: Openshift Container Storage
        image: quay.io/rhceph-dev/ocs-registry:4.14.5-8
        publisher: Red Hat
        sourceType: grpc
        priority: 100
       
          

      Actual results:

          download pod in CLBO, no CSV's available

      Expected results:

          ODF installed and operational

      Additional info:

      discussion on slack channel https://ibm-systems-storage.slack.com/archives/C05RJB6H0LQ/p1708954071299569

              lmohanty@redhat.com Lalatendu Mohanty
              rh-ee-dosypenk Daniel Osypenko
              None
              None
              Jian Zhang Jian Zhang
              None
              Amrita Mahapatra, Leela Gangavarapu, Neha Berry, suchita gatfane
              Votes:
              0 Vote for this issue
              Watchers:
              10 Start watching this issue

                Created:
                Updated:
                Resolved: