Uploaded image for project: 'Data Foundation Bugs'
  1. Data Foundation Bugs
  2. DFBUGS-767

[2276828] CephMonQuorumAtRisk and CephMonQuorumLost documnetation needs update with respect to five mon configurations

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Done
    • Icon: Critical Critical
    • odf-4.16
    • odf-4.15
    • None
    • False
    • Hide

      None

      Show
      None
    • False
    • Committed
    • Committed
    • None

      Description of problem (please be detailed as possible and provide log
      snippests):
      https://github.com/openshift/runbooks/blob/master/alerts/openshift-container-storage-operator/CephMonQuorumLost.md and https://github.com/openshift/runbooks/blob/master/alerts/openshift-container-storage-operator/CephMonQuorumAtRisk.md needs update with respect to five mon configuration

      Version of all relevant components (if applicable):
      OCP 4.15 and ODF 4.15.2-1

      Does this issue impact your ability to continue to work with the product
      (please explain in detail what is the user impact)?
      No

      Is there any workaround available to the best of your knowledge?
      NA

      Rate from 1 - 5 the complexity of the scenario you performed that caused this
      bug (1 - very simple, 5 - very complex)?
      1

      Can this issue reproducible?
      Yes

      Can this issue reproduce from the UI?
      Yes

      If this is a regression, please provide more details to justify this:
      NA

      Steps to Reproduce:
      1. Install OCP 4.15 and ODF 4.15.2-1 on six failure zone cluster
      2. Drain three of the nodes where CephMonQuorumAtRisk error is shown
      3. Open the documentation for the alert, It only describes about three mon clusters. whereas we should update the page for five mon configuration as well

      Actual results:
      CephMonQuorumAtRisk
      Meaning
      Storage cluster quorum is low. Multiple mons work together to provide redundancy by each keeping a copy of the metadata. Cluster is deployed with 3 mons, and require 2 or more mons to be up and running for quorum and for the storage operations to run.

      CephMonQuorumLost
      Meaning
      The number of monitors in the storage cluster are not enough. Multiple mons work together to provide redundancy by each keeping a copy of the metadata. Cluster is deployed with 3 mons, and require 2 or more mons to be up and running for quorum and for the storage operations to run.

      This alert indicates that there is only 1 monitor pod running or even non

      Expected results:

      CephMonQuorumLost
      Meaning
      The number of monitors in the storage cluster are not enough. Multiple mons work together to provide redundancy by each keeping a copy of the metadata. Cluster is deployed with 3 or 5 mons, and require 2 or more mons to be up and running for quorum and for the storage operations to run.

      This alert indicates that there is only 1 monitor pod running or even none

      CephMonQuorumAtRisk
      Meaning
      Storage cluster quorum is low. Multiple mons work together to provide redundancy by each keeping a copy of the metadata. Cluster is deployed with 3 or 5 mons, and require 2 or more mons to be up and running for quorum and for the storage operations to run.

      Additional info:

              nladha Nikhil Ladha
              rhn-support-jopinto Joy Pinto
              Joy Pinto Joy Pinto
              Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

                Created:
                Updated:
                Resolved: