Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Major
Fix Version/s: 4.12.z
Affects Version/s: 4.6
Component/s: Cloud Compute / OpenStack Provider
Labels:
- QA-Triaged
- Triaged

Test Coverage:

-
Regression:
None
Sprint:
ShiftStack Sprint 225, ShiftStack Sprint 226, ShiftStack Sprint 227, ShiftStack Sprint 228
sprint_count:
4
Release Blocker:
Rejected
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Release Note Text:

Hide
Previously, the `Keepalived` health check read the status of a load-balanced `kube-apiserver`. This can cause issues if a cluster recovers from an outage and the API server is unreliable, because the health check cannot find a healthy `kube-apiserver`.

For the {product-title} {product-version} release, the `Keepalived` router locates a functioning `HAProxy` router and then passes the health check operation of a `kube-apiserver` to this `HAProxy` router. This change prevents unnecessary API Virtual IP (VIP) failovers.

(link:https://issues.redhat.com/browse/OCPBUGS-1257[*~~OCPBUGS-1257~~*])

Show
Previously, the `Keepalived` health check read the status of a load-balanced `kube-apiserver`. This can cause issues if a cluster recovers from an outage and the API server is unreliable, because the health check cannot find a healthy `kube-apiserver`. For the {product-title} {product-version} release, the `Keepalived` router locates a functioning `HAProxy` router and then passes the health check operation of a `kube-apiserver` to this `HAProxy` router. This change prevents unnecessary API Virtual IP (VIP) failovers. (link: https://issues.redhat.com/browse/OCPBUGS-1257 [* OCPBUGS-1257 *])
Release Note Type:
Bug Fix
Release Note Status:
Done
Target Version:

4.12.0

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

This relates to the recovery of a cluster following an etcd outage.

The ingress path to kube-apiserver is:

───────────> VIP ─────────────────> Local HAProxy ────┬─> kube-apiserver-master-0
    (managed by keepalived)                           │
                                                      ├─> kube-apiserver-master-1
                                                      │
                                                      └─> kube-apiserver-master-2

Each master is running an HAProxy which load balances between the 3 kube-apiservers. Each HAProxy is running health checks against each kube-apiserver, and will add or remove it from the available pool based on its health.

We only use keepalived to ensure that HAProxy is not a single point of failure. It is the job of keepalived to ensure that incoming traffic is being directed to an HAProxy which is functioning correctly.

The current health check we are using for keepalived involves polling /readyz against the local HAProxy. While this seems intuitively correct it is in fact testing the wrong thing. It is testing whether the kube-apiserver it connects to is functioning correctly. However, this is not the purpose of keepalived. HAProxy runs health checks against kube-apiserver backends. keepalived simply selects a correctly functioning HAProxy.

This becomes important during recovery from an outage. When none of the kube-apiservers are healthy this health check will fail continuously, and the API VIP will move uselessly between masters. However the situation is much worse when only one of the kube-apiservers is up. In this case there is a high probability that it is overloaded and at least rate limiting incoming connections. This may lead us to fail the keepalived health check and fail the VIP over to the next HAProxy. This will cause all open kube-apiserver connections to reset, even the established ones. This increases the load on the kube-apiserver and increases the probability that the health check will fail again.

Ideally the keepalived health check would check only the health of HAProxy itself, not the health of the pool of kube-apiservers. In practise it will probably never be necessary to move the VIP while the master is up, regardless of the health of the cluster. A network partition affecting HAProxy would already be handled by VRRP between the masters, so it may be that it would be sufficient to check that the local HAProxy pod is healthy.

is cloned by

OCPBUGS-4605 Keepalived health check causes unnecessary VIP flapping when HAProxy is healthy

Closed

is depended on by

OCPBUGS-4605 Keepalived health check causes unnecessary VIP flapping when HAProxy is healthy

Closed

links to

openshift/machine-config-operator#3339: OCPBUGS-1257: Have keepalived check for haproxy status for API VIP

Assignee:: Martin André

Reporter:: Matthew Booth

QA Contact:: Ramón Lobillo

Doc Contact:: Darragh Fitzmaurice

Votes:: 1 Vote for this issue

Watchers:: 10 Start watching this issue

Created:: 2022/09/13 4:22 PM

Updated:: 2024/02/15 2:36 PM

Resolved:: 2023/01/17 7:39 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates