Uploaded image for project: 'Red Hat Workload Availability'
  1. Red Hat Workload Availability
  2. RHWA-16

NHC: Explore Scaling Improvements by introducing Storm Recovery Mechanism

XMLWordPrintable

    • False
    • Hide

      None

      Show
      None
    • False
    • Customer Escalated

      Promote maxUnhealthy: Investigate and potentially encourage users to adopt maxUnhealthy with fixed values (e.g., maxUnhealthy=5) as a more scalable alternative to minHealthy.

      Implement Storm Recovery mechanism : Introduce a new parameter, tentatively named StormTerminationStartTime. Once the number of healthy nodes drops/raises below/above minHealthy/maxUnhealthy, fencing should be delayed until:

      • minHealthy/maxUnhealthy was satisfied
      • StormTerminationStartTime has passed (after minHealthy/maxUnhealthy was satisfied)

        1. stormCooldownDuration-4.21-snr-nhc-connected.text
          175 kB
          vipin kumar
        2. debug2 storm not active after maxunhealthy2.html
          5 kB
          Michael Habash
        3. pre kni ocp edge04 oc.html
          47 kB
          Michael Habash
        4. nhc logs.html
          155 kB
          Michael Habash
        5. stormCooldownDuration-4.18-snr-nhc-connected.text
          142 kB
          vipin kumar

              mshitrit@redhat.com Michael Shitrit
              mshitrit@redhat.com Michael Shitrit
              Votes:
              2 Vote for this issue
              Watchers:
              14 Start watching this issue

                Created:
                Updated:
                Resolved: