Uploaded image for project: 'RHEL'
  1. RHEL
  2. RHEL-53525

Instability and System Freezes/Crash in KVM Virtualized Environments

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Normal Normal
    • None
    • None
    • None
    • No
    • Important
    • CustomerScenariosInitiative
    • rhel-sst-virtualization
    • ssg_virtualization
    • 800
    • False
    • Hide

      None

      Show
      None
    • None
    • None
    • None
    • None
    • None

      What were you trying to do that didn't work?

      During the upgrade of OpenShift from version 4.14.31 to 4.15.22, we have experienced intermittent freezes and crashes across both Windows and Linux Fedora virtual machines.

      One particular Fedora VM freezes and displays the message "marking tsc unstable due to clocksource watchdog" in the console. This issue has also been observed on other Fedora VMs that did not freeze. Additionally, some Windows VMs have experienced Event ID 41 reboots.

      I have come across a related bug report that appears similar to our situation, though it involves AMD cores my servers use Intel : 

      https://bugzilla.redhat.com/show_bug.cgi?id=2125671

      Please provide the package NVR for which bug is seen:

      sh-5.1# lscpu | grep -i 'vendor id'
      Vendor ID:                          GenuineIntel
      BIOS Vendor ID:                     Intel
      sh-5.1# uname -r
      5.14.0-284.75.1.el9_2.x86_64

      sh-5.1# cat /etc/redhat-release 
      Red Hat Enterprise Linux CoreOS release 4.15

      How reproducible:

      sometimes 

      Steps to reproduce

      1. Create an Open Shift system version 4.14.31 with 56 windows and 10 fedoras
      2. Upgrade to 4.15.22

      Expected results

      VMS do not crash or freeze

      Actual results

      VMS crash and freeze

        1. image (15).png
          48 kB
          Guy Chen
        2. image-2024-08-15-14-43-23-147.png
          16 kB
          Guy Chen
        3. image-2024-08-15-14-45-55-177.png
          16 kB
          Guy Chen
        4. image-2024-08-18-16-39-59-304.png
          103 kB
          Guy Chen
        5. image-2024-08-20-19-04-15-755.png
          16 kB
          Guy Chen
        6. image-2024-08-22-12-49-31-999.png
          50 kB
          Guy Chen
        7. kubevirt-evacuation-5c4sj.yml
          2 kB
          Guy Chen
        8. kubevirt-evacuation-hq9lj.yml
          2 kB
          Guy Chen
        9. kubevirt-evacuation-ld5tx.yml
          2 kB
          Guy Chen
        10. kubevirt-workload-update-mqwvm.yml
          2 kB
          Guy Chen
        11. kubevirt-workload-update-pnlvf.yml
          2 kB
          Guy Chen
        12. logs-virt-launcher-fedora-8-rs9p8.log
          127 kB
          Guy Chen
        13. must-gather.local.8948285524674220215.tar.gz
          94.23 MB
          Guy Chen
        14. Screenshot from 2024-08-08 12-13-23.png
          10 kB
          Guy Chen
        15. virt-launcher-fedora-8-rs9p8.yml
          9 kB
          Guy Chen

              virt-maint virt-maint
              guchen11 Guy Chen
              virt-maint virt-maint
              Xiaohui Li Xiaohui Li
              Votes:
              1 Vote for this issue
              Watchers:
              21 Start watching this issue

                Created:
                Updated: