Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-48044

[Nutanix] Install multi-NICs cluster failed with failureDomains

XMLWordPrintable

    • None
    • Proposed
    • False
    • Hide

      None

      Show
      None

      Description of problem:

          when installing cluster with multiple NICs in failureDomains, it always report "Duplicate value" error
      
      # ./openshift-install create cluster --dir cluster --log-level debug
      ...
      INFO Creating infra manifests...
      INFO Created manifest *v1.Namespace, namespace= name=openshift-cluster-api-guests
      DEBUG {"level":"info","ts":"2025-01-01T11:28:56Z","msg":"Starting workers","controller":"nutanixcluster","controllerGroup":"infrastructure.cluster.x-k8s.io","controllerKind":"NutanixCluster","worker count":10}
      DEBUG {"level":"info","ts":"2025-01-01T11:28:57Z","msg":"Starting workers","controller":"nutanixmachine","controllerGroup":"infrastructure.cluster.x-k8s.io","controllerKind":"NutanixMachine","worker count":10}
      INFO Created manifest *v1beta1.Cluster, namespace=openshift-cluster-api-guests name=sgao-nutanix-zonal-l96qg
      DEBUG I0101 11:28:58.918576 2309035 recorder.go:104] "Cluster sgao-nutanix-zonal-l96qg is Provisioning" logger="events" type="Normal" object={"kind":"Cluster","namespace":"openshift-cluster-api-guests","name":"sgao-nutanix-zonal-l96qg","uid":"d86c6f80-0f60-431d-80fc-bddd7b1f2d7c","apiVersion":"cluster.x-k8s.io/v1beta1","resourceVersion":"257"} reason="Provisioning"
      DEBUG Collecting applied cluster api manifests...
      DEBUG I0101 11:28:58.924319 2309035 warning_handler.go:65] "metadata.finalizers: \"cluster.cluster.x-k8s.io\": prefer a domain-qualified finalizer name to avoid accidental conflicts with other finalizer writers" logger="KubeAPIWarningLogger"
      ERROR failed to fetch Cluster: failed to generate asset "Cluster": failed to create cluster: failed to create infrastructure manifest: NutanixCluster.infrastructure.cluster.x-k8s.io "sgao-nutanix-zonal-l96qg" is invalid: spec.failureDomains[0].subnets[1]: Duplicate value: map[string]interface {}{"type":"uuid"}
      INFO Shutting down local Cluster API controllers...
      INFO Stopped controller: Cluster API
      WARNING process cluster-api-provider-nutanix exited with error: signal: killed
      INFO Stopped controller: nutanix infrastructure provider
      INFO Shutting down local Cluster API control plane...
      INFO Local Cluster API system has completed operations

      Version-Release number of selected component (if applicable):

          4.18.0-0.nightly-2025-01-04-101226

      How reproducible:

          always

      Steps to Reproduce:

          1. set multiple NICs in failureDomains and install cluster
      
      e.g.
          failureDomains:
          ...
            subnetUUIDs:
            - 512c1d6f-c6e7-4746-8ae2-9c3e1db2aba6
            - a94cb75c-24ff-4ee2-85cf-c2f906ee9fe5
          - name: failure-domain-2
          ...
            subnetUUIDs:
            - d1b1b617-23de-4a9d-b53f-4b386fc27600
          - name: failure-domain-3
          ...
            subnetUUIDs:
            - 3624b067-61e2-4703-b8bf-3810de5cbac1
      
          2.
          3.
          

      Actual results:

          Install failed

      Expected results:

          Install should succeed with multiple NICs configured

      Additional info:

      slack discussion pls refer to https://redhat-external.slack.com/archives/C0211848DBN/p1735790959497809    

              yanhli@redhat.com Yanhua Li
              sgaoshang Shang Gao
              Shang Gao Shang Gao
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

                Created:
                Updated: