Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-8703

fails to switch to kernel-rt with rhel 9.2

    • No
    • Approved
    • False
    • Hide

      This breaks switching to RT kernel and back to default kernel on RHCOS 9 nodes

      Show
      This breaks switching to RT kernel and back to default kernel on RHCOS 9 nodes
    • Hide
      3/7: Green as a fix is posted; waiting on merge before backporting to 4.13.
      3/3: telco reviewed
      Show
      3/7: Green as a fix is posted; waiting on merge before backporting to 4.13. 3/3: telco reviewed

      This came up a while ago, see https://groups.google.com/u/1/a/redhat.com/g/aos-devel/c/HuOTwtI4a9I/m/nX9mKjeqAAAJ

      Basically this MC:

      apiVersion: machineconfiguration.openshift.io/v1
      kind: MachineConfig
      metadata:
        labels:
          machineconfiguration.openshift.io/role: worker
        name: worker-override
      spec:
        kernelType: realtime
        osImageURL: quay.io/openshift-release-dev/ocp-v4.0-art-dev@sha256:b4cc3995d5fc11e3b22140d8f2f91f78834e86a210325cbf0525a62725f8e099 

       

      Will degrade the node with

       

      E0301 21:25:09.234001    3306 writer.go:200] Marking Degraded due to: error running rpm-ostree override remove kernel kernel-core kernel-modules kernel-modules-extra --install kernel-rt-core --install kernel-rt-modules --install kernel-rt-modules-extra --install kernel-rt-kvm: error: Could not depsolve transaction; 1 problem detected:
       Problem: package kernel-modules-core-5.14.0-282.el9.x86_64 requires kernel-uname-r = 5.14.0-282.el9.x86_64, but none of the providers can be installed
        - conflicting requests
      : exit status 1
       

       

      It's kind of annoying here because the packages to remove are now OS version dependent.  A while ago I filed https://github.com/coreos/rpm-ostree/issues/2542 which would push the problem down into rpm-ostree, which is in a better situation to deal with it, and that may be the fix...but it's also pushing the problem down there in a way that's going to be maintenance pain (but, we can deal with that).

       

      It's also possible that we may need to explicitly request installation of `kernel-rt-modules-core`...I'll look.

            [OCPBUGS-8703] fails to switch to kernel-rt with rhel 9.2

            Per the announcement sent regarding the removal of "Blocker" as an option in the Priority field, this issue (which was already closed at the time of the bulk update) had Priority = "Blocker." It is being updated to Priority = Critical. No additional fields were changed.

            OpenShift Jira Automation Bot added a comment - Per the announcement sent regarding the removal of "Blocker" as an option in the Priority field, this issue (which was already closed at the time of the bulk update) had Priority = "Blocker." It is being updated to Priority = Critical. No additional fields were changed.

            Since the problem described in this issue should be resolved in a recent advisory, it has been closed.

            For information on the advisory (Important: OpenShift Container Platform 4.13.0 security update), and where to find the updated files, follow the link below.

            If the solution does not work for you, open a new bug report.
            https://access.redhat.com/errata/RHSA-2023:1326

            Errata Tool added a comment - Since the problem described in this issue should be resolved in a recent advisory, it has been closed. For information on the advisory (Important: OpenShift Container Platform 4.13.0 security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2023:1326

            Rio Liu added a comment -

            Rio Liu added a comment - pre-merge tested https://github.com/openshift/machine-config-operator/pull/3604#issuecomment-1465488951

            Sinny Kumari added a comment - Moving this to POST as PR https://github.com/openshift/machine-config-operator/pull/3604 is place.

            Rio Liu added a comment -

            Rio Liu added a comment - per-merge testing is failed https://github.com/openshift/machine-config-operator/pull/3604#issuecomment-1463212353

              walters@redhat.com Colin Walters
              walters@redhat.com Colin Walters
              Rio Liu Rio Liu
              Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

                Created:
                Updated:
                Resolved: