Uploaded image for project: 'Red Hat Advanced Cluster Management'
  1. Red Hat Advanced Cluster Management
  2. ACM-2524

MCE2.2 - pods in CrashLoopBackOff after operators installation

XMLWordPrintable

    • False
    • None
    • False
    • Critical
    • +
    • Yes

      Description of problem:

      redhat-marketplace-catalog and certified-operators-catalog are in CrashLoopBackOff after MCE and hypershift operators installation

      $ oc get pods -n clusters-hyper-0 |grep CrashLoopBackOff
      certified-operators-catalog-6d945db8df-j29d4         0/1     CrashLoopBackOff   53 (4m49s ago)   4h11m
      redhat-marketplace-catalog-7d44769c67-4qvzr          0/1     CrashLoopBackOff   53 (4m45s ago)   4h11m

      in logs for both the same problem reported:

      Error: compute digest: compute hash: write tar: open /tmp/cache/cache: permission denied
      Usage:
        opm serve <source_path> [flags]

      Flags:
            --cache-dir string         if set, sync and persist server cache directory
            --cache-only               sync the serve cache and exit without serving
            --debug                    enable debug logging
        -h, --help                     help for serve
        -p, --port string              port number to serve on (default "50051")
        -t, --termination-log string   path to a container termination log file (default "/dev/termination-log")

      Global Flags:
            --skip-tls-verify   skip TLS certificate verification for container image registries while pulling bundles
            --use-http          use plain HTTP for container image registries while pulling bundles

      $ oc describe hostedclusters hyper-0 -n clusters
          Message:               [certified-operators-catalog deployment has 1 unavailable replicas, redhat-marketplace-catalog deployment has 1 unavailable replicas]
          Observed Generation:   2
          Reason:                UnavailableReplicas
          Status:                True
          Type:                  Degraded

      Version-Release number of selected component (if applicable):

      OCP 4.12.0-0.nightly-2022-12-26-191545
      multicluster-engine-mce-operator-bundle:v2.2.0-245

      How reproducible:

      100%

      Steps to Reproduce:

      1. Install hub cluster (3 maasters)
      2.  Install MCE and hypershift operators
      3. ...

      Actual results:

      see above

      Expected results:

      no problems reported

      Additional info:

      began to happen after noon of 26.12
      must-gather http://rhos-compute-node-10.lab.eng.rdu2.redhat.com/logs/ACM-2524-must-gather.tar.gz

              rokejungrh Roke Jung
              lshilin Lubov Shilin
              David Huynh David Huynh
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

                Created:
                Updated:
                Resolved: