Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-49324

Tuned Profile cannot be found for none ARM vendor id's

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Duplicate
    • Icon: Undefined Undefined
    • None
    • 4.19
    • Node Tuning Operator
    • None
    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • None
    • None
    • No
    • arm
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      Description of problem:

      The issue is that if the vendor_id of an arm machine is not "arm", the profile created by the node tuning operator could not be found

      Version-Release number of selected component (if applicable):

      4.19

      How reproducible:

      Always

      Steps to Reproduce:

      1. Apply a performanceprofile to the cluster
      2. Checking to see the Vendor ID in the respective node using lscpu
      sh-5.1# lscpu
      Architecture:             aarch64
        CPU op-mode(s):         32-bit, 64-bit
        Byte Order:             Little Endian
      CPU(s):                   8
        On-line CPU(s) list:    0-7
      Vendor ID:                APM
        BIOS Vendor ID:         QEMU
        BIOS Model name:        virt-8.2
        Model:                  2
        Thread(s) per core:     1
        Core(s) per socket:     1
        Socket(s):              8
        Stepping:               0x3
        BogoMIPS:               80.00
        Flags:                  fp asimd evtstrm aes pmull sha1 sha2 crc32 cpuid
      NUMA:
        NUMA node(s):           1
        NUMA node0 CPU(s):      0-7
      Vulnerabilities:
        Gather data sampling:   Not affected
        Itlb multihit:          Not affected
        L1tf:                   Not affected
        Mds:                    Not affected
        Meltdown:               Mitigation; PTI
        Mmio stale data:        Not affected
        Reg file data sampling: Not affected
        Retbleed:               Not affected
        Spec rstack overflow:   Not affected
        Spec store bypass:      Vulnerable
        Spectre v1:             Mitigation; __user pointer sanitization
        Spectre v2:             Vulnerable
        Srbds:                  Not affected
        Tsx async abort:        Not affected
       

      We can see here that the Vendor ID is APM and not ARM.

      Logging out the respective tuned pod  

      [root@ampere-hr350a-05 cluster-node-tuning-operator]# oc get clusterversion
      NAME      VERSION                                    AVAILABLE   PROGRESSING   SINCE   STATUS
      version   4.19.0-0.nightly-arm64-2025-01-21-060530   True        False         2d      Cluster version is 4.19.0-0.nightly-arm64-2025-01-21-060530
      [root@ampere-hr350a-05 cluster-node-tuning-operator]# oc get no,mcp
      NAME                                                  STATUS   ROLES                  AGE    VERSION
      node/ocp-ctlplane-0.libvirt.lab.eng.tlv2.redhat.com   Ready    control-plane,master   2d1h   v1.31.3
      node/ocp-ctlplane-1.libvirt.lab.eng.tlv2.redhat.com   Ready    control-plane,master   2d1h   v1.31.3
      node/ocp-ctlplane-2.libvirt.lab.eng.tlv2.redhat.com   Ready    control-plane,master   2d1h   v1.31.3
      node/ocp-worker-0.libvirt.lab.eng.tlv2.redhat.com     Ready    worker,worker-cnf      2d1h   v1.31.3
      node/ocp-worker-1.libvirt.lab.eng.tlv2.redhat.com     Ready    worker                 2d1h   v1.31.3
      node/ocp-worker-2.libvirt.lab.eng.tlv2.redhat.com     Ready    worker                 2d1h   v1.31.3
      
      NAME                                                             CONFIG                                                 UPDATED   UPDATING   DEGRADED   MACHINECOUNT   READYMACHINECOUNT   UPDATEDMACHINECOUNT   DEGRADEDMACHINECOUNT   AGE
      machineconfigpool.machineconfiguration.openshift.io/master       rendered-master-90d84ac134a787a72303253e739a840f       True      False      False      3              3                   3                     0                      2d1h
      machineconfigpool.machineconfiguration.openshift.io/worker       rendered-worker-75cc5a0c3018b679c87225f781ef77cc       True      False      False      2              2                   2                     0                      2d1h
      machineconfigpool.machineconfiguration.openshift.io/worker-cnf   rendered-worker-cnf-afa00eb9e54cece215716a50336938a9   True      False      False      1              1                   1                     0                      46h
      [root@ampere-hr350a-05 cluster-node-tuning-operator]# oc get profiles -A
      NAMESPACE                                NAME                                             TUNED                                    APPLIED   DEGRADED   MESSAGE                                                            AGE
      openshift-cluster-node-tuning-operator   ocp-ctlplane-0.libvirt.lab.eng.tlv2.redhat.com   openshift-control-plane                  True      False      TuneD profile applied.                                             2d1h
      openshift-cluster-node-tuning-operator   ocp-ctlplane-1.libvirt.lab.eng.tlv2.redhat.com   openshift-control-plane                  True      False      TuneD profile applied.                                             2d1h
      openshift-cluster-node-tuning-operator   ocp-ctlplane-2.libvirt.lab.eng.tlv2.redhat.com   openshift-control-plane                  True      False      TuneD profile applied.                                             2d1h
      openshift-cluster-node-tuning-operator   ocp-worker-0.libvirt.lab.eng.tlv2.redhat.com     openshift-node-performance-performance   False     True       The TuneD daemon profile not yet applied, or application failed.   2d1h
      openshift-cluster-node-tuning-operator   ocp-worker-1.libvirt.lab.eng.tlv2.redhat.com     openshift-node                           True      False      TuneD profile applied.                                             2d1h
      openshift-cluster-node-tuning-operator   ocp-worker-2.libvirt.lab.eng.tlv2.redhat.com     openshift-node                           True      False      TuneD profile applied.                                             2d1h
      [root@ampere-hr350a-05 cluster-node-tuning-operator]# oc get pods -o wide
      NAME                                            READY   STATUS    RESTARTS   AGE    IP                NODE                                             NOMINATED NODE   READINESS GATES
      cluster-node-tuning-operator-5dd57557d6-hpbfj   1/1     Running   0          2d     10.133.0.48       ocp-ctlplane-1.libvirt.lab.eng.tlv2.redhat.com   <none>           <none>
      tuned-29hzp                                     1/1     Running   1          2d1h   192.168.122.143   ocp-ctlplane-1.libvirt.lab.eng.tlv2.redhat.com   <none>           <none>
      tuned-42fhx                                     1/1     Running   1          2d1h   192.168.122.240   ocp-worker-2.libvirt.lab.eng.tlv2.redhat.com     <none>           <none>
      tuned-4tjvv                                     1/1     Running   1          2d1h   192.168.122.215   ocp-ctlplane-2.libvirt.lab.eng.tlv2.redhat.com   <none>           <none>
      tuned-b6qft                                     1/1     Running   1          2d1h   192.168.122.241   ocp-worker-1.libvirt.lab.eng.tlv2.redhat.com     <none>           <none>
      tuned-jv5rb                                     1/1     Running   1          2d1h   192.168.122.100   ocp-ctlplane-0.libvirt.lab.eng.tlv2.redhat.com   <none>           <none>
      tuned-p5qsm                                     1/1     Running   3          2d1h   192.168.122.51    ocp-worker-0.libvirt.lab.eng.tlv2.redhat.com     <none>           <none>
      tuned-p5qsm-debug-t67mv                         1/1     Running   0          13m    192.168.122.51    ocp-worker-0.libvirt.lab.eng.tlv2.redhat.com     <none>           <none>
      [root@ampere-hr350a-05 cluster-node-tuning-operator]# oc logs pod/tuned-p5qsm
      I0123 12:24:58.271512    3397 controller.go:1666] starting in-cluster ocp-tuned v4.19.0-202501160507.p0.geb1cee9.assembly.stream.el9-0-g0481a45-dirty
      I0123 12:24:58.303092    3397 controller.go:671] writing /var/lib/ocp-tuned/image.env
      I0123 12:24:58.317573    3397 controller.go:702] tunedRecommendFileRead(): read "openshift-node-performance-performance" from "/etc/tuned/recommend.d/50-openshift.conf"
      I0123 12:24:58.318981    3397 controller.go:1728] starting: profile unpacked is "openshift-node-performance-performance" fingerprint "8030f1e6761e591d20cf502cb2bd857f0f84ef0c6a2d21bf2416c7a896765a52"
      I0123 12:24:58.319053    3397 controller.go:1424] recover: no pending deferred change
      I0123 12:24:58.319069    3397 controller.go:1734] starting: no pending deferred update
      I0123 12:25:13.954962    3397 controller.go:382] disabling system tuned...
      I0123 12:25:14.020991    3397 controller.go:1546] started events processors
      I0123 12:25:14.021597    3397 controller.go:1567] monitoring filesystem events on "/etc/tuned/bootcmdline"
      I0123 12:25:14.021630    3397 controller.go:1570] started controller
      I0123 12:25:14.045579    3397 controller.go:359] set log level 0
      I0123 12:25:14.046458    3397 controller.go:692] tunedRecommendFileWrite(): written "/etc/tuned/recommend.d/50-openshift.conf" to set TuneD profile openshift-node-performance-performance
      I0123 12:25:14.046551    3397 controller.go:417] profilesExtract(): extracting 6 TuneD profiles (recommended=openshift-node-performance-performance)
      I0123 12:25:14.116439    3397 controller.go:462] profilesExtract(): recommended TuneD profile openshift-node-performance-performance content unchanged [openshift]
      I0123 12:25:14.117777    3397 controller.go:462] profilesExtract(): recommended TuneD profile openshift-node-performance-performance content unchanged [openshift-node-performance-performance]
      I0123 12:25:14.118359    3397 controller.go:478] profilesExtract(): fingerprint of extracted profiles: "8030f1e6761e591d20cf502cb2bd857f0f84ef0c6a2d21bf2416c7a896765a52"
      I0123 12:25:14.118553    3397 controller.go:818] tunedReload()
      I0123 12:25:14.118704    3397 controller.go:745] starting tuned...
      I0123 12:25:14.118748    3397 run.go:121] running cmd...
      2025-01-23 12:25:14,338 INFO     tuned.daemon.application: TuneD: 2.24.0, kernel: 5.14.0-552.el9.aarch64+64k
      2025-01-23 12:25:14,339 INFO     tuned.daemon.application: dynamic tuning is globally disabled
      2025-01-23 12:25:14,344 INFO     tuned.daemon.daemon: using sleep interval of 1 second(s)
      2025-01-23 12:25:14,344 INFO     tuned.daemon.daemon: Running in automatic mode, checking what profile is recommended for your configuration.
      2025-01-23 12:25:14,345 INFO     tuned.daemon.daemon: Using 'openshift-node-performance-performance' profile
      2025-01-23 12:25:14,348 INFO     tuned.profiles.loader: loading profile: openshift-node-performance-performance
      2025-01-23 12:25:14,448 ERROR    tuned.daemon.daemon: Cannot set initial profile. No tunings will be enabled: Cannot load profile(s) 'openshift-node-performance-performance': Cannot find profile 'openshift-node--performance-aarch64-performance' in '['/var/lib/ocp-tuned/profiles', '/usr/lib/tuned', '/usr/lib/tuned/profiles']'.
      2025-01-23 12:25:14,449 INFO     tuned.daemon.controller: starting controller

      Actual results:

          

      Expected results:

          

      Additional info:

       

              msivak@redhat.com Martin Sivak
              rh-ee-rshemtov Roy Shemtov
              None
              None
              Liquan Cui Liquan Cui
              None
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated:
                Resolved: