Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-46598

Evicted Pods owned by Catalogsource are not rescheduled

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Major Major
    • None
    • 4.16.z
    • OLM
    • Moderate
    • None
    • Diglett OLM Sprint 264
    • 1
    • False
    • Hide

      None

      Show
      None
    • Olm catalog registry pods that were terminated by the kubelet with reason "TerminationByKubelet" were not being recreated by the catalog operator. This patch includes the fix for it.
    • Bug Fix
    • In Progress

      This is a clone of issue OCPBUGS-46474. The following is the description of the original issue:

      Description of problem:

      For various reasons, Pods may get evicted. Once they are evicted, the owner of the Pod should recreate the Pod so it is scheduled again.

      With OLM, we can see that evicted Pods owned by Catalogsources are not rescheduled. The outcome is that all subscriptions have a "ResolutionFailed=True" condition, which hinders an upgrade of the operator. Specifically the customer is seeing an affected CatalogSource is "multicluster-engine-CENSORED_NAME-redhat-operator-index "in openshift-marketplace namespace, pod name: "multicluster-engine-CENSORED_NAME-redhat-operator-index-5ng9j"

      Version-Release number of selected component (if applicable):

      OpenShift Container Platform 4.16.21

      How reproducible:

      Sometimes, when Pods are evicted on the cluster

      Steps to Reproduce:

      1. Set up an OpenShift Container Platform 4.16 cluster, install various Operators
      2. Create a condition that a Node will evict Pods (for example by creating DiskPressure on the Node)
      3. Observe if any Pods owned by CatalogSources are being evicted

      Actual results:

      If Pods owned by CatalogSources are being evicted, they are not recreated / rescheduled.

      Expected results:

      When Pods owned by CatalogSources are being evicted, they are being recreacted / rescheduled.

      Additional info:

              lmohanty@redhat.com Lalatendu Mohanty
              openshift-crt-jira-prow OpenShift Prow Bot
              Xia Zhao Xia Zhao
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated: