Loading...

XML

Word

Printable

Type: Bug
Resolution: Unresolved
Priority: Undefined
Fix Version/s: None
Affects Version/s: 4.18.z, 4.19.z, 4.21, 4.20.z
Component/s: Networking / router
Labels:

Activity Type:
None
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
Critical
Regression:
None
Architecture:

x86_64
Deployment Environment:
Production

Target Backport Versions:
None
Target Version:

4.22.0
Release Blocker:
Rejected
Sprint:
NI&D Sprint 284
sprint_count:
1

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

PX Impact Score:

Release Note Status:
None
Release Note Type:
None
Release Note Text:
None

Escape Reason:
None
Escape Impact:
None
Corrective Measures:
None
SDLC stage when should've been found:
None

Description of problem:

The Ingress Router pods are experiencing instability under high load. Specifically, whenever the configured maximum connection limit (maxconn) is saturated, the liveness probe fails and pods terminate and restart instead of gracefully throttling traffic.

Version-Release number of selected component (if applicable):

latest

How reproducible:

With an existing route, generate a load over maxconn, we used https://github.com/mparram/test-backend?tab=readme-ov-file#run-in-openshift

Steps to Reproduce:

    1. Install and configure https://github.com/mparram/test-backend?tab=readme-ov-file#run-in-openshift
    2. Set the maxconn to 2000 (optional)
    3. Scale the replicas enough to reach maxconn on haproxy (could be checked looking at haproxy_frontend_current_sessions metric)
    4. Review the haproxy behavior

Actual results:

The Kubelet restarts the HAProxy container following a liveness probe failure.

Expected results:

Haproxy gracefully throttling traffic.

Additional info:

is depended on by

OCPBUGS-67219 the router pod restarted in the stress traffic test

Closed

relates to

NE-2354 Support Prometheus monitoring metric for router max connections

In Progress

links to

openshift/router#737: OCPBUGS-67161: Replace HTTP backend liveness check with admin socket check

Assignee:: Andrey Lebedev

Reporter:: Jose Ortiz Padilla

QA Contact:: Shudi Li

Need Info From:: None

Votes:: 0 Vote for this issue

Watchers:: 10 Start watching this issue

Created:: 2025/12/10 10:20 AM

Updated:: 2026/02/26 8:01 AM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates