Uploaded image for project: 'Red Hat Workload Availability'
  1. Red Hat Workload Availability
  2. RHWA-16

NHC: Explore Scaling Improvements by introducing Storm Recovery Threshold

XMLWordPrintable

    • Icon: Story Story
    • Resolution: Unresolved
    • Icon: Undefined Undefined
    • None
    • rhwa-25.1
    • Node Healthcheck
    • None
    • False
    • Hide

      None

      Show
      None
    • False

      Promote maxUnhealthy: Investigate and potentially encourage users to adopt maxUnhealthy with fixed values (e.g., maxUnhealthy=5) as a more scalable alternative to minHealthy.

      Implement stormRecoveryThreshold: Introduce a new parameter, tentatively named stormRecoveryThreshold. Once the number of healthy nodes drops below minHealthy, fencing should be delayed until the number of unhealthy nodes reaches this stormRecoveryThreshold. This would prevent premature fencing during transient issues and potential "storm" scenarios.

              mshitrit@redhat.com Michael Shitrit
              mshitrit@redhat.com Michael Shitrit
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

                Created:
                Updated: