Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-45416

clusteroperator/machine-config blips Degraded=True during upgrade test

    • None
    • False
    • Hide

      None

      Show
      None
    • Release Note Not Required
    • In Progress

      This is a clone of issue OCPBUGS-39199. The following is the description of the original issue:

      Description of problem:

          In an effort to ensure all HA components are not degraded by design during normal e2e test or upgrades, we are collecting all operators that are blipping Degraded=True during any payload job run.
      
      This card captures machine-config operator that blips Degraded=True during upgrade runs.
      
      
      Example Job: https://prow.ci.openshift.org/view/gs/test-platform-results/logs/periodic-ci-openshift-release-master-ci-4.18-upgrade-from-stable-4.17-e2e-azure-ovn-upgrade/1843023092004163584   
      
      Reasons associated with the blip: RenderConfigFailed   
      
      For now, we put an exception in the test. But it is expected that teams take action to fix those and remove the exceptions after the fix go in.
      
      Exceptions are defined here: 
      
      
      See linked issue for more explanation on the effort.

      Version-Release number of selected component (if applicable):

          

      How reproducible:

          

      Steps to Reproduce:

          1.
          2.
          3.
          

      Actual results:

          

      Expected results:

          

      Additional info:

          

            [OCPBUGS-45416] clusteroperator/machine-config blips Degraded=True during upgrade test

            Errata Tool added a comment -

            Since the problem described in this issue should be resolved in a recent advisory, it has been closed.

            For information on the advisory (Important: OpenShift Container Platform 4.18.1 bug fix and security update), and where to find the updated files, follow the link below.

            If the solution does not work for you, open a new bug report.
            https://access.redhat.com/errata/RHSA-2024:6122

            Errata Tool added a comment - Since the problem described in this issue should be resolved in a recent advisory, it has been closed. For information on the advisory (Important: OpenShift Container Platform 4.18.1 bug fix and security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2024:6122

            No issue seen for 4.19 here and  OCPBUGS-48250 is fixed. Hence moving the status to VERIFIED

            Prachiti Talgulkar added a comment - No issue seen for 4.19 here and   OCPBUGS-48250 is fixed. Hence moving the status to VERIFIED

            Hello djoshy, while verifying it is seen that the error is produced again for 4.19 to 4.18 upgrade CI 
            https://prow.ci.openshift.org/view/gs/test-platform-results/logs/periodic-ci-openshift-release-master-ci-4.19-upgrade-from-stable-4.18-e2e-aws-ovn-upgrade/1877030047424974848

            Jan 08 18:28:29.232 E clusteroperator/machine-config condition/Degraded reason/RenderConfigFailed status/True Failed to resync 4.18.0-0.ci-2025-01-08-112840 because: timed out: refusing to read images.json version "4.19.0-0.ci-2025-01-08-162046", operator version "4.18.0-0.ci-2025-01-08-112840" (exception: https://issues.redhat.com/browse/MCO-1447)
            Jan 08 18:28:29.232 - 1367s E clusteroperator/machine-config condition/Degraded reason/RenderConfigFailed status/True Failed to resync 4.18.0-0.ci-2025-01-08-112840 because: timed out: refusing to read images.json version "4.19.0-0.ci-2025-01-08-162046", operator version "4.18.0-0.ci-2025-01-08-112840" (exception: outside of upgrade window https://issues.redhat.com/browse/TRT-1575)
            Jan 08 18:51:16.907 W clusteroperator/machine-config condition/Degraded status/False 

            can please take a look into this? Thankyou! 

            Prachiti Talgulkar added a comment - Hello djoshy , while verifying it is seen that the error is produced again for 4.19 to 4.18 upgrade CI  https://prow.ci.openshift.org/view/gs/test-platform-results/logs/periodic-ci-openshift-release-master-ci-4.19-upgrade-from-stable-4.18-e2e-aws-ovn-upgrade/1877030047424974848 Jan 08 18:28:29.232 E clusteroperator/machine-config condition/Degraded reason/RenderConfigFailed status/True Failed to resync 4.18.0-0.ci-2025-01-08-112840 because: timed out: refusing to read images.json version "4.19.0-0.ci-2025-01-08-162046" , operator version "4.18.0-0.ci-2025-01-08-112840" (exception: https: //issues.redhat.com/browse/MCO-1447) Jan 08 18:28:29.232 - 1367s E clusteroperator/machine-config condition/Degraded reason/RenderConfigFailed status/True Failed to resync 4.18.0-0.ci-2025-01-08-112840 because: timed out: refusing to read images.json version "4.19.0-0.ci-2025-01-08-162046" , operator version "4.18.0-0.ci-2025-01-08-112840" (exception: outside of upgrade window https: //issues.redhat.com/browse/TRT-1575) Jan 08 18:51:16.907 W clusteroperator/machine-config condition/Degraded status/False can please take a look into this? Thankyou! 

            Fix included in build 4.18.0-0.nightly-2024-12-30-200229
            Will wait for few days to check no error observed for 4.18 release here.

            Prachiti Talgulkar added a comment - Fix included in build 4.18.0-0.nightly-2024-12-30-200229 Will wait for few days to check no error observed for 4.18 release here .

              team-mco Team MCO
              openshift-crt-jira-prow OpenShift Prow Bot
              Sergio Regidor de la Rosa Sergio Regidor de la Rosa
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

                Created:
                Updated:
                Resolved: