Uploaded image for project: 'Red Hat Advanced Cluster Management'
  1. Red Hat Advanced Cluster Management
  2. ACM-2275

managedcluster-import-controller-v2 OOM at scale ~2400 managedclusters

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Done
    • Icon: Undefined Undefined
    • None
    • ACM 2.7.0
    • Cluster Lifecycle
    • None
    • False
    • Hide

      None

      Show
      None
    • False
    • No

      Description of problem:

      While deploying 3000+ SNOs with ACM and ZTP, the managedcluster-import-controller-v2 eventually OOMs and prevents new installed clusters from becoming managed.

      Test:

      • Attempted to install 3591 SNOs
      • Successfully installed 3114
      • Only Managed 2419

      Version-Release number of selected component (if applicable):

      2.7.0-DOWNSTREAM-2022-11-25-10-53-02
      OCP 4.11.13 (Hub and managedclusters)

      How reproducible:

      Steps to Reproduce:

      1.  
      2.  
      3. ...

      Actual results:

      Expected results:

      Additional info:

      The container has a 2Gi memory limit, I am bumping it to 16Gi in this test evironment to see if the remaining clusters become managed. 

      # oc get po -n multicluster-engine                                                                                                                  
      NAME                                                   READY   STATUS             RESTARTS          AGE                                                                                      
      managedcluster-import-controller-v2-66896db6b4-vgrbh   0/1     CrashLoopBackOff   244 (75s ago)     28h                                                                           
      managedcluster-import-controller-v2-66896db6b4-zkfs8   0/1     CrashLoopBackOff   244 (114s ago)    28h
      

              wliu1 Wei Liu
              akrzos@redhat.com Alex Krzos
              Alex Krzos Alex Krzos
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated:
                Resolved: