-
Bug
-
Resolution: Done
-
Critical
-
odf-4.15
-
None
Description of problem (please be detailed as possible and provide log
snippests):
https://github.com/openshift/runbooks/blob/master/alerts/openshift-container-storage-operator/CephMonQuorumLost.md and https://github.com/openshift/runbooks/blob/master/alerts/openshift-container-storage-operator/CephMonQuorumAtRisk.md needs update with respect to five mon configuration
Version of all relevant components (if applicable):
OCP 4.15 and ODF 4.15.2-1
Does this issue impact your ability to continue to work with the product
(please explain in detail what is the user impact)?
No
Is there any workaround available to the best of your knowledge?
NA
Rate from 1 - 5 the complexity of the scenario you performed that caused this
bug (1 - very simple, 5 - very complex)?
1
Can this issue reproducible?
Yes
Can this issue reproduce from the UI?
Yes
If this is a regression, please provide more details to justify this:
NA
Steps to Reproduce:
1. Install OCP 4.15 and ODF 4.15.2-1 on six failure zone cluster
2. Drain three of the nodes where CephMonQuorumAtRisk error is shown
3. Open the documentation for the alert, It only describes about three mon clusters. whereas we should update the page for five mon configuration as well
Actual results:
CephMonQuorumAtRisk
Meaning
Storage cluster quorum is low. Multiple mons work together to provide redundancy by each keeping a copy of the metadata. Cluster is deployed with 3 mons, and require 2 or more mons to be up and running for quorum and for the storage operations to run.
CephMonQuorumLost
Meaning
The number of monitors in the storage cluster are not enough. Multiple mons work together to provide redundancy by each keeping a copy of the metadata. Cluster is deployed with 3 mons, and require 2 or more mons to be up and running for quorum and for the storage operations to run.
This alert indicates that there is only 1 monitor pod running or even non
Expected results:
CephMonQuorumLost
Meaning
The number of monitors in the storage cluster are not enough. Multiple mons work together to provide redundancy by each keeping a copy of the metadata. Cluster is deployed with 3 or 5 mons, and require 2 or more mons to be up and running for quorum and for the storage operations to run.
This alert indicates that there is only 1 monitor pod running or even none
CephMonQuorumAtRisk
Meaning
Storage cluster quorum is low. Multiple mons work together to provide redundancy by each keeping a copy of the metadata. Cluster is deployed with 3 or 5 mons, and require 2 or more mons to be up and running for quorum and for the storage operations to run.
Additional info:
- external trackers