Uploaded image for project: 'JBoss Enterprise Application Platform 6'
  1. JBoss Enterprise Application Platform 6
  2. JBPAPP6-833

mod_cluster: Failover on worker shutdown takes too much time

    Details

    • Type: Bug
    • Status: Closed (View Workflow)
    • Priority: Major
    • Resolution: Done
    • Affects Version/s: EAP 6.0.0 ER 3_Beta1, EAP 6.0.0 ER 8, EAP 6.0.0
    • Fix Version/s: EAP 6.0.0
    • Component/s: None
    • Environment:

      mod_cluster 1.2.Final Windows, RHEL, Solaris

    • Affects:
      Release Notes

      Description

      This JIRA captures the fact that failover, even with shutdown (not kill) is quite slow.
      What do you think about this:

      10.16.89.39 - - [14/Mar/2012:16:12:24 -0400] "GET /SessionTest/SessionTestServlet HTTP/1.1" 200 2
      10.16.89.39 - - [14/Mar/2012:16:12:24 -0400] "GET /SessionTest/SessionTestServlet HTTP/1.1" 200 2
      10.16.88.188 - - [14/Mar/2012:16:12:27 -0400] "DISABLE-APP / HTTP/1.1" 200 -
      10.16.88.188 - - [14/Mar/2012:16:12:27 -0400] "DISABLE-APP / HTTP/1.1" 200 -
      10.16.88.188 - - [14/Mar/2012:16:12:27 -0400] "STOP-APP / HTTP/1.1" 200 74
      10.16.88.188 - - [14/Mar/2012:16:12:27 -0400] "STOP-APP / HTTP/1.1" 200 81
      10.16.88.188 - - [14/Mar/2012:16:12:27 -0400] "REMOVE-APP / HTTP/1.1" 200 -
      10.16.88.188 - - [14/Mar/2012:16:12:27 -0400] "REMOVE-APP /* HTTP/1.1" 200 -
      10.16.89.39 - - [14/Mar/2012:16:12:28 -0400] "GET /SessionTest/SessionTestServlet HTTP/1.1" 503 323
      10.16.89.39 - - [14/Mar/2012:16:12:29 -0400] "GET /SessionTest/SessionTestServlet HTTP/1.1" 503 323
      10.16.89.39 - - [14/Mar/2012:16:12:30 -0400] "GET /SessionTest/SessionTestServlet HTTP/1.1" 503 323
      10.16.89.39 - - [14/Mar/2012:16:12:31 -0400] "GET /SessionTest/SessionTestServlet HTTP/1.1" 503 323
      10.16.89.39 - - [14/Mar/2012:16:12:33 -0400] "GET /SessionTest/SessionTestServlet HTTP/1.1" 503 323
      10.16.89.39 - - [14/Mar/2012:16:12:35 -0400] "GET /SessionTest/SessionTestServlet HTTP/1.1" 503 323
      10.16.89.39 - - [14/Mar/2012:16:12:36 -0400] "GET /SessionTest/SessionTestServlet HTTP/1.1" 503 323
      10.16.89.39 - - [14/Mar/2012:16:12:39 -0400] "GET /SessionTest/SessionTestServlet HTTP/1.1" 200 2
      10.16.89.39 - - [14/Mar/2012:16:12:40 -0400] "GET /SessionTest/SessionTestServlet HTTP/1.1" 200 2
      10.16.89.39 - - [14/Mar/2012:16:12:41 -0400] "GET /SessionTest/SessionTestServlet HTTP/1.1" 200 2
      10.16.89.39 - - [14/Mar/2012:16:12:41 -0400] "GET /SessionTest/SessionTestServlet HTTP/1.1" 200 2
      

      There were 7 "503" HTTP errors in 15 seconds time span, despite the fact that the balancer has received the REMOVE-APP /* message... Error_log on pastebin.

      Is it ok, that there was no DISABLE-APP and STOP-APP for context /* ?
      Mod_cluster 1.1.3 with EAP5 was not presenting this behaviour

      Note: We are talking just manual testing on windows(balancer) and 2 RHEL workers here, just Ctrl+F5 in Firefox and Ctrl+C in terminal. No hundreds of thousands of requests and killing jvm with -9.

        Gliffy Diagrams

          Attachments

            Issue Links

              Activity

                People

                • Assignee:
                  pferraro Paul Ferraro
                  Reporter:
                  mbabacek Michal Karm
                • Votes:
                  0 Vote for this issue
                  Watchers:
                  5 Start watching this issue

                  Dates

                  • Created:
                    Updated:
                    Resolved: