Uploaded image for project: 'Red Hat Data Grid'
  1. Red Hat Data Grid
  2. JDG-3991

OpenShift: Server killed by OOM restarts forever after

XMLWordPrintable

    • Impediment
    • Hide

      Shutdown whole cluster and bring it back.

      Show
      Shutdown whole cluster and bring it back.
    • Hide
      • Deploy a default cluster of two nodes
      • Start filling default cache till eviction starts removing incoming entries
      • Keep sending entries, one of the servers gets eventually killed while memory usage is close to a maximum (512MB in this case)
      Show
      Deploy a default cluster of two nodes Start filling default cache till eviction starts removing incoming entries Keep sending entries, one of the servers gets eventually killed while memory usage is close to a maximum (512MB in this case)
    • DataGrid Sprint #56, DataGrid Sprint #61, DataGrid Sprint #62

      When user fully fills default cache, even though eviction is in place, server gets eventually killed while under the load. Server killed that way isn't able to start properly after that due to:

      FATAL (main) [org.infinispan.SERVER] ISPN080028: Red Hat Data Grid Server failed to start java.util.concurrent.ExecutionException: org.infinispan.manager.EmbeddedCacheManagerStartupException: org.infinispan.commons.CacheException: Initial state transfer timed out for cache org.infinispan.LOCKS on example-infinispan-0-61960
      

      Full failing server log in the attachment.

        1. example-infinispan-0-after-failure.log
          1.07 MB
          Pavel Drobek
        2. server.log
          8 kB
          Pavel Drobek

              remerson@redhat.com Ryan Emerson
              pdrobek@redhat.com Pavel Drobek (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              12 Start watching this issue

                Created:
                Updated:
                Resolved: