Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-74416

Component Readiness: [Etcd] [Alerts] test regressed

    • None
    • False
    • Hide

      None

      Show
      None
    • None
    • None
    • None
    • None
    • None
    • Rejected
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      (Feel free to update this bug's summary to be more specific.)
      Component Readiness has found a potential regression in the following test:

      [Monitor:legacy-test-framework-invariants-alerts][bz-etcd][invariant] alert/etcdHighCommitDurations should not be at or above info

      Significant regression detected.
      Fishers Exact probability of a regression: 100.00%.
      Test pass rate dropped from 100.00% to 86.84%.

      Sample (being evaluated) Release: 4.22
      Start Time: 2026-01-19T00:00:00Z
      End Time: 2026-01-26T12:00:00Z
      Success Rate: 86.84%
      Successes: 33
      Failures: 5
      Flakes: 0
      Base (historical) Release: 4.20
      Start Time: 2025-09-21T00:00:00Z
      End Time: 2025-10-21T00:00:00Z
      Success Rate: 100.00%
      Successes: 104
      Failures: 0
      Flakes: 0

      View the test details report for additional context.

      Pattern here indicates etcd is really struggling. The masters in this job are using m5.2xlarge, we compared to a similar standard OCP job on AWS and found we're using m6a.xlarge, which is half the vcpus and RAM but was discovered to have half the EBS bandwidth, which would be a direct problem for etcd. We recommend moving to faster disk bandwidth to remedy this problem.

      Filed by: dgoodwin@redhat.com

              rh-ee-alesross Alessandro Rossi
              openshift-trt OpenShift Technical Release Team
              None
              None
              None
              None
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

                Created:
                Updated:
                Resolved: