Uploaded image for project: 'Machine Config Operator'
  1. Machine Config Operator
  2. MCO-1635

Add runbook for HighOverallControlPlaneMemory alert

XMLWordPrintable

    • Icon: Story Story
    • Resolution: Done
    • Icon: Major Major
    • None
    • None

      MCO will send an alert when a node  for 1 hour, when all control plane node have extremely high memory usage

      The alerts describes the following 

                  summary: >-
                    Memory utilization across all control plane nodes is high, and could impact responsiveness and stability.
                  description: >-
                    Given three control plane nodes, the overall memory utilization may only be about 2/3 of all available capacity.
                    This is because if a single control plane node fails, the kube-apiserver and etcd may be slow to respond.
                    To fix this, increase memory of the control plane nodes.

      It is possible that admin may not be able to interpret exact action to be taken after looking at the alert and the cluster state. Adding runbook (https://github.com/openshift/runbooks) can help admin in better troubleshooting and taking appropriate action.

       

      Acceptance Criteria:

      • Runbook doc is created for HighOverallControlPlaneMemory alert
      • Created runbook link is accessible to cluster admin with HighOverallControlPlaneMemory  alert

       

              rhn-support-cruhm Courtney Ruhm
              rhn-support-cruhm Courtney Ruhm
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

                Created:
                Updated:
                Resolved: