Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-15200

"Cluster operator X is updating versions" is not a reason for Failing=True condition

XMLWordPrintable

    • No
    • 3
    • OTA 254, OTA 255, OTA 256, OTA 257, OTA 258, OTA 259
    • 6
    • False
    • Hide

      None

      Show
      None
    • Hide
      * Previously, the Cluster Version Operator (CVO) did not filter internal errors that were propogated to the `ClusterVersion Failing` condition message. As a result, errors that did not negatively impact the update were shown for the ClusterVersion Failing condition message. With this release, the errors that are propogated to the "ClusterVersion Failing" condition message are filtered. (link:https://issues.redhat.com/browse/OCPBUGS-15200[*OCPBUGS-15200*])
      Show
      * Previously, the Cluster Version Operator (CVO) did not filter internal errors that were propogated to the `ClusterVersion Failing` condition message. As a result, errors that did not negatively impact the update were shown for the ClusterVersion Failing condition message. With this release, the errors that are propogated to the "ClusterVersion Failing" condition message are filtered. (link: https://issues.redhat.com/browse/OCPBUGS-15200 [* OCPBUGS-15200 *])
    • Bug Fix
    • Done

      Description of problem:

      During the build02 update from 4.14.0-ec.1 to ec.2 I have noticed the following:

      
      $ b02 get clusterversion version -o json | jq '.status.conditions[] | select (.type=="Failing")'
      {
        "lastTransitionTime": "2023-06-20T13:40:12Z",
        "message": "Multiple errors are preventing progress:\n* Cluster operator authentication is updating versions\n* Could not update customresourcedefinition \"alertingrules.monitoring.openshift.io\" (512 of 993): the object is invalid, possibly due to local cluster configuration",
        "reason": "MultipleErrors",
        "status": "True",
        "type": "Failing"
      }
      
      

      There is a valid error (the Could not update customresourcedefinition... one) but the whole thing is cluttered by the "Cluster operator authentication is updating versions" message, which is imo not a legit reason for Failing=True condition and should not be there. Before I captured this one I saw the message with three operators instead of just one.

      Version-Release number of selected component (if applicable):

      4.14.0-ec.2
      

      How reproducible:

      No idea
      

              dhurta@redhat.com David Hurta
              afri@afri.cz Petr Muller
              Dinesh Kumar S Dinesh Kumar S
              Votes:
              0 Vote for this issue
              Watchers:
              10 Start watching this issue

                Created:
                Updated:
                Resolved: