Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-36775

RHEL worker nodes no longer work due to missing MGLRU

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Done-Errata
    • Icon: Major Major
    • 4.16.z
    • 4.16, 4.17
    • None
    • Important
    • Yes
    • Proposed
    • False
    • Hide

      None

      Show
      None

      This is a clone of issue OCPBUGS-36711. The following is the description of the original issue:

      Description of problem:

          With the changes in https://github.com/openshift/machine-config-operator/pull/4425, RHEL worker nodes fail as follows:
      
      [root@ptalgulk-0807c-fq97t-w-a-l-rhel-1 cloud-user]# systemctl --failed
        UNIT                  LOAD   ACTIVE SUB    DESCRIPTION                
      ● disable-mglru.service loaded failed failed Disables MGLRU on Openshfit
      
      LOAD   = Reflects whether the unit definition was properly loaded.
      ACTIVE = The high-level unit activation state, i.e. generalization of SUB.
      SUB    = The low-level unit activation state, values depend on unit type.
      
      1 loaded units listed. Pass --all to see loaded but inactive units, too.
      To show all installed unit files use 'systemctl list-unit-files'.
      [root@ptalgulk-0807c-fq97t-w-a-l-rhel-1 cloud-user]# journalctl -u disable-mglru.service
      -- Logs begin at Mon 2024-07-08 06:23:03 UTC, end at Mon 2024-07-08 08:31:35 UTC. --
      Jul 08 06:23:14 localhost.localdomain systemd[1]: Starting Disables MGLRU on Openshfit...
      Jul 08 06:23:14 localhost.localdomain bash[710]: /usr/bin/bash: /sys/kernel/mm/lru_gen/enabled: No such file or directory
      Jul 08 06:23:14 localhost.localdomain systemd[1]: disable-mglru.service: Main process exited, code=exited, status=1/FAILURE
      Jul 08 06:23:14 localhost.localdomain systemd[1]: disable-mglru.service: Failed with result 'exit-code'.
      Jul 08 06:23:14 localhost.localdomain systemd[1]: Failed to start Disables MGLRU on Openshfit.
      Jul 08 06:23:14 localhost.localdomain systemd[1]: disable-mglru.service: Consumed 4ms CPU time
      
      We should only disable mglru if it exists.

      Version-Release number of selected component (if applicable):

          4.16, 4.17

      How reproducible:

          Attempt to bring up rhel worker node

      Steps to Reproduce:

          1.
          2.
          3.
          

      Actual results:

          

      Expected results:

          

      Additional info:

          

              team-mco Team MCO
              openshift-crt-jira-prow OpenShift Prow Bot
              Prachiti Talgulkar Prachiti Talgulkar
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

                Created:
                Updated:
                Resolved: