Uploaded image for project: 'Product Technical Learning'
  1. Product Technical Learning
  2. PTL-8586

DO280-398: Lockups due to OOM


    • Icon: Bug Bug
    • Resolution: Done
    • Icon: Critical Critical
    • DO280 - OCP 3.9 1 20180828
    • DO280
    • None
    • 32G, core i7, ssd laptops.  Foundation 7.5

    • ILT
    • en-US (English)

      Reporter RHNID:
      Section: -
      Language: en-US (English)
      Workaround: Be sure students are deleting old projects (helps but doesn't solve it). on master: systemctl restart atomic-openshift-* when it happens.

      Description: First time running v3.9 DO280.  This was on the 32G, core i7, ssd laptops.

      Several students, and myself, have experienced bad delays in executing commands such as oc get pods and ssh root@master.  Wait long enough and it would eventually respond; but the wait could be from several seconds to over a minute.

      Investigating, I noted high cpu usage on our foundations from time to time - eventually narrowed it master taking up the most cpu cycles.  I also noted low available memory.  Running top, it appears the culprit is kswapd0.

      I believe this can be resolved by allocating more memory to master.  We have it available - DO280 is a Level IV class now so...

            jimrigsbee_jira Jim Rigsbee (Inactive)
            dlewis7444 David Lewis (Inactive)
            0 Vote for this issue
            4 Start watching this issue
