Uploaded image for project: 'RHEL'
  1. RHEL
  2. RHEL-53525

Instability and System Freezes/Crash in KVM Virtualized Environments

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Normal Normal
    • None
    • None
    • None
    • No
    • Important
    • CustomerScenariosInitiative
    • rhel-sst-virtualization
    • ssg_virtualization
    • 800
    • False
    • Hide

      None

      Show
      None
    • None
    • None
    • None
    • None
    • None

      What were you trying to do that didn't work?

      During the upgrade of OpenShift from version 4.14.31 to 4.15.22, we have experienced intermittent freezes and crashes across both Windows and Linux Fedora virtual machines.

      One particular Fedora VM freezes and displays the message "marking tsc unstable due to clocksource watchdog" in the console. This issue has also been observed on other Fedora VMs that did not freeze. Additionally, some Windows VMs have experienced Event ID 41 reboots.

      I have come across a related bug report that appears similar to our situation, though it involves AMD cores my servers use Intel : 

      https://bugzilla.redhat.com/show_bug.cgi?id=2125671

      Please provide the package NVR for which bug is seen:

      sh-5.1# lscpu | grep -i 'vendor id'
      Vendor ID:                          GenuineIntel
      BIOS Vendor ID:                     Intel
      sh-5.1# uname -r
      5.14.0-284.75.1.el9_2.x86_64

      sh-5.1# cat /etc/redhat-release 
      Red Hat Enterprise Linux CoreOS release 4.15

      How reproducible:

      sometimes 

      Steps to reproduce

      1. Create an Open Shift system version 4.14.31 with 56 windows and 10 fedoras
      2. Upgrade to 4.15.22

      Expected results

      VMS do not crash or freeze

      Actual results

      VMS crash and freeze

        1. image (15).png
          image (15).png
          48 kB
        2. image-2024-08-15-14-43-23-147.png
          image-2024-08-15-14-43-23-147.png
          16 kB
        3. image-2024-08-15-14-45-55-177.png
          image-2024-08-15-14-45-55-177.png
          16 kB
        4. image-2024-08-18-16-39-59-304.png
          image-2024-08-18-16-39-59-304.png
          103 kB
        5. image-2024-08-20-19-04-15-755.png
          image-2024-08-20-19-04-15-755.png
          16 kB
        6. image-2024-08-22-12-49-31-999.png
          image-2024-08-22-12-49-31-999.png
          50 kB
        7. kubevirt-evacuation-5c4sj.yml
          2 kB
        8. kubevirt-evacuation-hq9lj.yml
          2 kB
        9. kubevirt-evacuation-ld5tx.yml
          2 kB
        10. kubevirt-workload-update-mqwvm.yml
          2 kB
        11. kubevirt-workload-update-pnlvf.yml
          2 kB
        12. logs-virt-launcher-fedora-8-rs9p8.log
          127 kB
        13. must-gather.local.8948285524674220215.tar.gz
          94.23 MB
        14. Screenshot from 2024-08-08 12-13-23.png
          Screenshot from 2024-08-08 12-13-23.png
          10 kB
        15. virt-launcher-fedora-8-rs9p8.yml
          9 kB

              virt-maint virt-maint
              guchen11 Guy Chen
              virt-maint virt-maint
              Xiaohui Li Xiaohui Li
              Votes:
              1 Vote for this issue
              Watchers:
              21 Start watching this issue

                Created:
                Updated: