Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-53166

Component Readiness: [Installer / openshift-installer] [Other] test regressed: 4.19-e2e-gcp-ovn-techpreview tests often got openshift-samples CO unstable

XMLWordPrintable

    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • None
    • Moderate
    • Yes
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      install should succeed: cluster operator stability

      Description of problem:

          The 4.19 PROW CI tests "e2e-gcp-ovn-techpreview" / "e2e-gcp-ovn-techpreview-serial" often met cluster operator openshift-samples not stable, where the success rate is less than 30%. While the corresponding 4.17 and 4.18 tests have more than 97% success rate. 

      Version-Release number of selected component (if applicable):

          for example, 4.19.0-0.nightly-2025-03-15-150336

      How reproducible:

          Often (>70%)

      Steps to Reproduce:

          1. enable "featureSet: TechPreviewNoUpgrade" in install-config, then "create cluster"    

      Actual results:

      level=info msg=Waiting up to 30m0s (until 4:21PM UTC) to ensure each cluster operator has finished progressing...
      level=error msg=Error checking cluster operator Progressing status: "context deadline exceeded"
      level=error msg=These cluster operators were not stable: [openshift-samples]

      Expected results:

      The cluster operator openshift-samples should turn stable within the 30 minutes, just as all other cluster operators. 

      Additional info:

      (1) Finally the cluster operator openshift-samples does turn stable, which was after the waiting for 30 minutes timed out. 
      
      (2) "omg get co openshift-samples -o yaml" tells below error condition:
      
        - lastTransitionTime: '2025-03-15T16:25:07Z'
          message: 'Samples installed at 4.19.0-0.nightly-2025-03-15-150336, with image
            import failures for these imagestreams: postgresql,openshift-service-ca.crt,kube-root-ca.crt;
            last import attempt 2025-03-15 15:51:33 +0000 UTC'
          reason: FailedImageImports
          status: 'False'
          type: Progressing
      
      (3) The test history, where we can see frequent failures: 
      
      https://prow.ci.openshift.org/job-history/gs/test-platform-results/logs/periodic-ci-openshift-release-master-ci-4.19-e2e-gcp-ovn-techpreview
      
      https://prow.ci.openshift.org/job-history/gs/test-platform-results/logs/periodic-ci-openshift-release-master-ci-4.19-e2e-gcp-ovn-techpreview-serial
      
      (4) The must-gather of https://prow.ci.openshift.org/view/gs/test-platform-results/logs/periodic-ci-openshift-release-master-ci-4.19-e2e-gcp-ovn-techpreview/1900927143378948096 is available at https://drive.google.com/file/d/1OTxsm4Hs0SQdTDLL4jVcI0qNrj9K0YHw/view?usp=drive_link

              rhn-support-spoole Shannon Poole
              rhn-support-jiwei Jianli Wei
              None
              None
              Jitendar Singh Jitendar Singh
              None
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

                Created:
                Updated:
                Resolved: