Loading...

XML

Word

Printable

Type: Bug
Resolution: Won't Do
Priority: Undefined
Fix Version/s: None
Affects Version/s: 4.7
Component/s: Networking / ovn-kubernetes
Labels:
None

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
Important
Regression:
None

Target Backport Versions:
None
Target Version:

4.13.0
Release Blocker:
Rejected
Sprint:
None

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

PX Priority Data:
PX Impact Score:

Release Note Status:
None
Release Note Type:
None
Release Note Text:
None

Escape Reason:
None
Escape Impact:
None
Corrective Measures:
None
SDLC stage when should've been found:
None

Description of problem:
There can be scenarios that cause 1 or more of the 3 DBs to form its own raft cluster. This leads to a very broken state for OVN. Currently ovn-dbchecker does not look for mismatching cluster ids across nodes. We need at least a way to alert that this scenario has happened. Even better, once we detect this situation we need to recover the cluster. The recovery is drastic and involves:

blowing away all the db files
restarting all ovnkube master pods
issuing on all ovn-controllers (or deleting them):
ovn-appctl sb-cluster-state-reset

One possible solution is having something that identifies this scenario and then annotates all of the ovnk pods to signal what they need to do with regard to the above steps.

Assignee:: Martin Kennelly

Reporter:: Tim Rozet

Need Info From:: None

Contributors:: None

QA Contact:: Anurag Saxena

Doc Contact:: None

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Created:: 2022/08/19 3:56 PM

Updated:: 2025/12/26 3:09 PM

Resolved:: 2022/10/21 12:38 PM

Details

Description

Attachments

Easy Agile Planning Poker

Activity

People

Dates