Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Undefined
Fix Version/s: 4.19.z
Affects Version/s: 4.19, 4.20
Component/s: Networking / router
Labels:

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
1
Severity:
Low
Regression:
None

Target Backport Versions:
None
Target Version:

4.19.z
Release Blocker:
Rejected
Sprint:
NI&D Sprint 276
sprint_count:
1

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

PX Impact Score:

Release Note Status:
Proposed
Release Note Type:
Bug Fix
Release Note Text:

Hide
Before this update, the `HorizontalPodAutoscaler` temporarily scaled `istiod-openshift-gateway` deployment to two replicas, causing Continuous Integration (CI) failure due to the tests expecting only one replica. With this release, `HorizontalPodAutoscaler` scaling verifies that the `istiod-openshift-gateway` has at least one replica to continue deployment. (link:https://issues.redhat.com/browse/OCPBUGS-60204[~~OCPBUGS-60204~~])

Show
Before this update, the `HorizontalPodAutoscaler` temporarily scaled `istiod-openshift-gateway` deployment to two replicas, causing Continuous Integration (CI) failure due to the tests expecting only one replica. With this release, `HorizontalPodAutoscaler` scaling verifies that the `istiod-openshift-gateway` has at least one replica to continue deployment. (link: https://issues.redhat.com/browse/OCPBUGS-60204 [ OCPBUGS-60204 ])

Escape Reason:
None
Escape Impact:
None
Corrective Measures:
None
SDLC stage when should've been found:
None

This is a clone of issue ~~OCPBUGS-59894~~. The following is the description of the original issue:
—

Description of problem

CI can fail because of test failures such as the following:

    gateway_api_test.go:158: failed to find expected Istiod control plane: too many pods for deployment openshift-ingress/istiod-openshift-gateway: 2

This failure comes from https://prow.ci.openshift.org/view/gs/test-platform-results/pr-logs/pull/openshift_cluster-ingress-operator/1245/pull-ci-openshift-cluster-ingress-operator-master-e2e-aws-operator/1949881963301048320.

Version-Release number of selected component (if applicable)

I have seen this in 4.20.

How reproducible

I have only seen it happen once.

Steps to Reproduce

1. Post a PR and have bad luck.

Actual results

CI fails.

Expected results

CI passes, or fails on some other test failure.

Additional info

The failure occurred because HPA scaled istiod out temporarily to 2 replicas. I found the following event in the must-gather archive for the referenced CI run:

apiVersion: v1
count: 1
eventTime: null
firstTimestamp: "2025-07-28T19:10:42Z"
involvedObject:
  apiVersion: autoscaling/v2
  kind: HorizontalPodAutoscaler
  name: istiod-openshift-gateway
  namespace: openshift-ingress
  resourceVersion: "86760"
  uid: 112b10f4-bad3-433f-9bb0-f0c1ca333e06
kind: Event
lastTimestamp: "2025-07-28T19:10:42Z"
message: 'New size: 2; reason: cpu resource utilization (percentage of request)
  above target'
metadata:
  creationTimestamp: "2025-07-28T19:10:42Z"
  managedFields:
  # ...
  name: istiod-openshift-gateway.18567fffeaf8b275
  namespace: openshift-ingress
  resourceVersion: "86914"
  uid: e3f5d32f-9370-46be-ae56-934293cf68f7
reason: SuccessfulRescale
reportingComponent: horizontal-pod-autoscaler
reportingInstance: ""
source:
  component: horizontal-pod-autoscaler
type: Normal

We can consider turning off HPA, but it isn't clear why the test expects the number of pod replicas to be exactly 1.

clones

OCPBUGS-59894 CI fails on testGatewayAPIIstioInstallation because Istiod has too many pods

Closed

is blocked by

OCPBUGS-59894 CI fails on testGatewayAPIIstioInstallation because Istiod has too many pods

Closed

links to

openshift/cluster-ingress-operator#1260: [release-4.19] OCPBUGS-60204: Update GatewayAPI test to check if deployment has 1 or more pod

mentioned on

Merge request - Shipment for 4.19.9

Assignee:: Ishmam Amin

Reporter:: Miciah Masters

Need Info From:: None

Contributors:: None

QA Contact:: Ishmam Amin

Doc Contact:: None

Votes:: 0 Vote for this issue

Watchers:: 7 Start watching this issue

Created:: 2025/08/06 8:20 PM

Updated:: 2025/09/15 1:46 PM

Resolved:: 2025/09/02 3:29 PM

Details

Description

Description of problem

Version-Release number of selected component (if applicable)

How reproducible

Steps to Reproduce

Actual results

Expected results

Additional info

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates

Hide