[OCPBUGS-38078] Abnormal values for 'router.openshift.io/haproxy.health.check.interval' annotation breaks the router-default pods - Red Hat Issue Tracker

Type: Bug
Resolution: Unresolved
Priority: Major
Fix Version/s: None
Affects Version/s: 4.14, 4.15, 4.16
Component/s: Networking / router
Labels:
- ne-triaged
- pre-merge-verify

Severity:
Moderate
Regression:
None
Story Points:
3
Sprint:
NE Sprint 257, NE Sprint 258, NE Sprint 259, NE Sprint 260, NE Sprint 261, NE Sprint 262, NE Sprint 263, NE Sprint 264, NE Sprint 265
sprint_count:
9
Release Blocker:
Rejected
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Release Note Text:

Hide
*Cause*: No out of bounds validation for the router.openshift.io/haproxy.health.check.interval annotation allows to set its value to one exceeding the maximum handled by HAProxy.
*Consequence*: The timer overflow alert messages are reported and the router-default pod never reaches the ready state.
*Fix*: Validate router.openshift.io/haproxy.health.check.interval annotation value to ensure it is within the range that HAProxy can parse, effectively capping the value at 2147483647 ms (~24.8 days).
*Result*: router.openshift.io/haproxy.health.check.interval annotation is set to a value that can be parsed by HAProxy.

Show
*Cause*: No out of bounds validation for the router.openshift.io/haproxy.health.check.interval annotation allows to set its value to one exceeding the maximum handled by HAProxy. *Consequence*: The timer overflow alert messages are reported and the router-default pod never reaches the ready state. *Fix*: Validate router.openshift.io/haproxy.health.check.interval annotation value to ensure it is within the range that HAProxy can parse, effectively capping the value at 2147483647 ms (~24.8 days). *Result*: router.openshift.io/haproxy.health.check.interval annotation is set to a value that can be parsed by HAProxy.
Release Note Type:
Bug Fix
Release Note Status:
In Progress
Target Version:

4.19

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

PX Impact Score:
PX Priority Data:
PX Review Complete:
PX Technical Impact Notes:

Description of problem:

There is no clipValue function for the annotation router.openshift.io/haproxy.health.check.interval. Once any value with abnormal values, the router-default starts to report the following messages:

[ALERT]    (50) : config : [/var/lib/haproxy/conf/haproxy.config:13791] : 'server be_secure:xxx:httpd-gateway-route/pod:xxx:xxx-gateway-service:pass-through-https:10.129.xx.xx:8243' : timer overflow in argument <50000d> to <inter> of server pod:xxx:xxx:pass-through-https:10.129.xx.xx:8243, maximum value is 2147483647 ms (~24.8 days)..

In the above case, the value 50000d was passed to the route annotation router.openshift.io/haproxy.health.check.interval accidentally

Version-Release number of selected component (if applicable):

How reproducible:

Easily

Steps to Reproduce:

1. Run the following script and this will break the cluster

oc get routes -A | awk '{print $1 " " $2}' | tail -n+2 | while read line; do    
 read -r namespace routename <<<$(echo $line)   echo -n "NS: $namespace | "   echo "ROUTENAME: $routename"   
 CMD="oc annotate route -n $namespace $routename --overwrite router.openshift.io/haproxy.health.check.interval=50000d"   
 echo "Annotating route with:"   
 echo $CMD ; eval "$CMD"  
 echo "---" 
done

Actual results:

    The alert messages are reported and the router-default pod never reaches the ready state.

Expected results:

    Clip the value in order to prevent the issue

Additional info:

is related to

OCPBUGS-6958 Route 'haproxy.router.openshift.io/timeout' value is not validated

Closed

links to

openshift/router#618: OCPBUGS-38078: Validate HAProxy health check interval time value

RHEA-2024:11038 OpenShift Container Platform 4.19.z bug fix update

Values higher than 24d in route annotation 'router.openshift.io/haproxy.health.check.interval' affects the ingress routers operation

Assignee:: Grzegorz Piotrowski

Reporter:: Bruno Gomes

QA Contact:: Ishmam Amin

Votes:: 0 Vote for this issue

Watchers:: 10 Start watching this issue

Created:: 2024/08/07 10:14 AM

Updated:: 2025/04/22 7:05 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates

Hide