Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Critical
Fix Version/s: 1.3.16.Final
Affects Version/s: 1.3.13.Final
Component/s: mod_proxy_cluster (native httpd modules)
Labels:
None

Steps to Reproduce:
Hide

An easy way to recreate such a hung JVM that is never removed is suspending it like so:

kill -STOP $PID
Show
An easy way to recreate such a hung JVM that is never removed is suspending it like so: kill -STOP $PID
Release Note Text:
Undefined
Git Pull Request:
https://github.com/modcluster/mod_cluster/pull/512

If a backend JVM is entirely hung (socket still listening, but no requests ever processed, no STATUS MCMPs ever sent), then mod_cluster does not handle it well currently as traffic is never routed off the bad instance and the bad instance is never removed from the balancer.

In such a state, requests always persistently timeout, but this doesn't put the balancer member in an error state so requests continue to it. Periodic pings may be attempted and will fail, but that does not stop requests to the problem instance. After 60 ping failures, the node could be removed, but the logic here is problematic as any attempted request (which still times out) results in the failure count being reset:

            if (elected == oldelected) {
...
            } else
                ou->mess.num_failure_idle = 0;

So at least any continually failing request attempts should not result in the ping failure count being reset and preventing the node removal. We may also consider preventing any requests to a JVM if its pings are currently failing.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

patch.MODCLUSTER-732
2021/04/19 3:23 PM
5 kB
Jean-Frederic Clere

clones

JBCS-1100 mod_cluster never removes hung JVM that has requests routed to it

Closed

Assignee:: Jean-Frederic Clere

Reporter:: Aaron Ogburn

Tester:: Paul Lodge

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Created:: 2021/04/19 2:42 PM

Updated:: 2022/08/19 9:18 AM

Resolved:: 2022/08/19 9:18 AM

Details

Description

Attachments

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates