Loading...

XML

Word

Printable

Type: Task
Resolution: Obsolete
Priority: Major
Fix Version/s: None
Affects Version/s: None
Component/s: None
Labels:
None

Story Points:
5
Blocked:
False
Ready:
False
Release Note Text:
Undefined
Market:

Sprint:
SDN Sprint 203, SDN Sprint 204
Cost of Delay:
0
WSJF:
0.0

SFDC Cases Links:
SFDC Cases Counter:

Implement a pre-puller logic (https://coreos.slack.com/archives/CDCP2LA9L/p1623683204281100?thread_ts=1623666860.270300&cid=CDCP2LA9L) for reducing downtime during ovn-k8s upgades.

The CNI 2.0 fix for a prober that can identify when a node is ready or not which was the preferred way to solve https://bugzilla.redhat.com/show_bug.cgi?id=1943334 and https://bugzilla.redhat.com/show_bug.cgi?id=1943336 might not happen anytime soon or might be a bit tricky.

So the way we see now, we could do a combo of things:

1) Do the pre-puller and reduce the downtime from 60sec further.

2) Do the tainting as well, - even if its not efficient and doesn't cover all use cases like node restarts, at least from CI perspective it will solve the bugs.

3) We can utilize the `Status()` probes of the CRIO/CNI and leverage when the plugin deletes the config versus sets it up. Problem here is to make multus reflect the changes done by SDN/OVN plugins.

links to

openshift/cluster-network-operator#1141: [WIP] SDN-1955: Add pre-puller ds to reduce upgrade downtime

Assignee:: Surya Seetharaman

Reporter:: Surya Seetharaman

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Created:: 2021/06/17 12:19 PM

Updated:: 2022/03/15 1:23 PM

Resolved:: 2021/07/23 10:54 AM

Details

Description

Attachments

Issue Links

Activity

People

Dates