Uploaded image for project: 'OpenShift Hive'
  1. OpenShift Hive
  2. HIVE-2139

Track down ClusterSync controller memory leaks

XMLWordPrintable

    • Icon: Story Story
    • Resolution: Done
    • Icon: Blocker Blocker
    • None
    • None
    • None
    • False
    • None
    • False

      So two things here:

      1. The green and aqua graphs (replicas 0 and 2) show a clear and serious problem. These replicas were handling syncsets with consistent errors. That's no excuse to leak memory like that. The vertical blue line is where we paused syncsets for the offending clusters (see OHSS-18481) and you can see the graphs level off nicely. However, the excess memory is not reclaimed. It oughtta be.
      2. The overall trend of the yellow graph (replica 1, no erroring syncsets) is still upward, albeit relatively slowly. So there's leakage even without failing syncsets.

      If code inspection doesn't reveal the problem here (I looked through and couldn't see anything obvious) then we need to profile the thing and dig deeper.

              rh-ee-mold Mark Old
              efried.openshift Eric Fried
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

                Created:
                Updated:
                Resolved: