Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-22095

PerformanceProfile render fails at Day-0 because the master/worker pools are not yet present

XMLWordPrintable

    • Moderate
    • No
    • CNF Compute Sprint 248, CNF Compute Sprint 249, CNF Compute Sprint 250, CNF Compute Sprint 251, CNF Compute Sprint 252, CNF Compute Sprint 253
    • 6
    • False
    • Hide

      None

      Show
      None
    • 2024-04-10: Needs a retest due to the last comment. The issue here is likely fixed already.

      Description of problem:

      The installer supports pre-rendering of the PerformanceProfile related manifests. However the MCO render is executed after the PerfProfile render and so the master and worker MachineConfigPools are created too late.
      
      This causes the installation process to fail with:
      
      Oct 18 18:05:25 localhost.localdomain bootkube.sh[537963]: I1018 18:05:25.968719       1 render.go:73] Rendering files into: /assets/node-tuning-bootstrap
      Oct 18 18:05:26 localhost.localdomain bootkube.sh[537963]: I1018 18:05:26.008421       1 render.go:133] skipping "/assets/manifests/99_feature-gate.yaml" [1] manifest because of unhandled *v1.FeatureGate
      Oct 18 18:05:26 localhost.localdomain bootkube.sh[537963]: I1018 18:05:26.013043       1 render.go:133] skipping "/assets/manifests/cluster-dns-02-config.yml" [1] manifest because of unhandled *v1.DNS
      Oct 18 18:05:26 localhost.localdomain bootkube.sh[537963]: I1018 18:05:26.021978       1 render.go:133] skipping "/assets/manifests/cluster-ingress-02-config.yml" [1] manifest because of unhandled *v1.Ingress
      Oct 18 18:05:26 localhost.localdomain bootkube.sh[537963]: I1018 18:05:26.023016       1 render.go:133] skipping "/assets/manifests/cluster-network-02-config.yml" [1] manifest because of unhandled *v1.Network
      Oct 18 18:05:26 localhost.localdomain bootkube.sh[537963]: I1018 18:05:26.023160       1 render.go:133] skipping "/assets/manifests/cluster-proxy-01-config.yaml" [1] manifest because of unhandled *v1.Proxy
      Oct 18 18:05:26 localhost.localdomain bootkube.sh[537963]: I1018 18:05:26.023445       1 render.go:133] skipping "/assets/manifests/cluster-scheduler-02-config.yml" [1] manifest because of unhandled *v1.Scheduler
      Oct 18 18:05:26 localhost.localdomain bootkube.sh[537963]: I1018 18:05:26.024475       1 render.go:133] skipping "/assets/manifests/cvo-overrides.yaml" [1] manifest because of unhandled *v1.ClusterVersion
      Oct 18 18:05:26 localhost.localdomain bootkube.sh[537963]: F1018 18:05:26.037467       1 cmd.go:53] no MCP found that matches performance profile node selector "node-role.kubernetes.io/master="

      Version-Release number of selected component (if applicable):

      4.14.0-rc.6

      How reproducible:

      Always

      Steps to Reproduce:

      1. Add an SNO PerformanceProfile to extra manifest in the installer. Node selector should be: "node-role.kubernetes.io/master="
      2.
      3.
      

      Actual results:

      no MCP found that matches performance profile node selector "node-role.kubernetes.io/master="

      Expected results:

      Installation completes

      Additional info:

      apiVersion: performance.openshift.io/v2
      kind: PerformanceProfile
      metadata:
       name: openshift-node-workload-partitioning-sno
      spec:
       cpu:
         isolated: 4-X <- must match the topology of the node
         reserved: 0-3
       nodeSelector:
         node-role.kubernetes.io/master: ""
      

            msivak@redhat.com Martin Sivak
            msivak@redhat.com Martin Sivak
            Dwaine Gonyier Dwaine Gonyier
            Egli Hila
            Votes:
            0 Vote for this issue
            Watchers:
            12 Start watching this issue

              Created:
              Updated: