Uploaded image for project: 'OCP Technical Release Team'
  1. OCP Technical Release Team
  2. TRT-1682

openshift-e2e-loki image pull problems is incorrectly failing jobs

XMLWordPrintable

    • Icon: Story Story
    • Resolution: Done
    • Icon: Normal Normal
    • None
    • 4.16.0
    • False
    • None
    • False

      Component Readiness has found a potential regression in [Unknown][invariant] alert/KubePodNotReady should not be at or above info in all the other namespaces.

      Probability of significant regression: 98.28%

      Sample (being evaluated) Release: 4.16
      Start Time: 2024-05-17T00:00:00Z
      End Time: 2024-05-23T23:59:59Z
      Success Rate: 93.75%
      Successes: 45
      Failures: 3
      Flakes: 0

      Base (historical) Release: 4.15
      Start Time: 2024-02-01T00:00:00Z
      End Time: 2024-02-28T23:59:59Z
      Success Rate: 100.00%
      Successes: 135
      Failures: 0
      Flakes: 0

      View the test details report at https://sippy.dptools.openshift.org/sippy-ng/component_readiness/test_details?arch=amd64&baseEndTime=2024-02-28%2023%3A59%3A59&baseRelease=4.15&baseStartTime=2024-02-01%2000%3A00%3A00&capability=Other&component=Unknown&confidence=95&environment=sdn%20upgrade-minor%20amd64%20aws%20standard&excludeArches=arm64%2Cheterogeneous%2Cppc64le%2Cs390x&excludeClouds=openstack%2Cibmcloud%2Clibvirt%2Covirt%2Cunknown&excludeVariants=hypershift%2Cosd%2Cmicroshift%2Ctechpreview%2Csingle-node%2Cassisted%2Ccompact&groupBy=cloud%2Carch%2Cnetwork&ignoreDisruption=true&ignoreMissing=false&minFail=3&network=sdn&pity=5&platform=aws&sampleEndTime=2024-05-23%2023%3A59%3A59&sampleRelease=4.16&sampleStartTime=2024-05-17%2000%3A00%3A00&testId=openshift-tests-upgrade%3A57b9d37e7f1d80cb25d3ba4386abc630&testName=%5BUnknown%5D%5Binvariant%5D%20alert%2FKubePodNotReady%20should%20not%20be%20at%20or%20above%20info%20in%20all%20the%20other%20namespaces&upgrade=upgrade-minor&variant=standard

      This problem boils down to openshift-e2e-loki image pull problems causing a KubePodNotReady to be firing at the end of the test run, which counts as a test failure.

      Established precedent is that nothing in Loki infra (specific to our test clusters, not the product) should be able to fail a job.

      We need to grant an exception for this openshift namespace.

      Sippy's test details can be used to check if this problem is still occurring.

              rhn-engineering-dgoodwin Devan Goodwin
              rhn-engineering-dgoodwin Devan Goodwin
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

                Created:
                Updated:
                Resolved: