Uploaded image for project: 'Red Hat Advanced Cluster Management'
  1. Red Hat Advanced Cluster Management
  2. ACM-8728

Machine Pool scaling doesn't work for Openstack cluster

XMLWordPrintable

    • False
    • None
    • False
    • Yes
    • Critical

      Description of problem:

      After provisioning an openstack cluster on 4.13.11 and 4.14.0-rc.0, we can see that the machine pools are at 0, and any scaling action does not do anything:

      Scaling up to 4 does not initiate any provisioning on the cluster:

      Version-Release number of selected component (if applicable):

      ACM 2.9.0-DOWNSTREAM-2023-09-11-15-47-23

      How reproducible:

      Steps to Reproduce:

      1. deploy openstack cluster on 4.14.0-rc.0
      2. check machine pools for cluster
      3. ...

      Actual results:

      Expected results:

      Additional info:

      CD (hub):

       apiVersion: hive.openshift.io/v1
      kind: ClusterDeployment
      metadata:
        annotations:
          open-cluster-management.io/user-group: c3lzdGVtOmNsdXN0ZXItYWRtaW5zLHN5c3RlbTphdXRoZW50aWNhdGVk
          open-cluster-management.io/user-identity: a3ViZTphZG1pbg==
        creationTimestamp: "2023-09-12T15:41:47Z"
        finalizers:
        - hive.openshift.io/deprovision
        generation: 3
        labels:
          cloud: OpenStack
          cluster.open-cluster-management.io/clusterset: default
          hive.openshift.io/cluster-platform: openstack
          hive.openshift.io/cluster-region: unknown
          hive.openshift.io/version: 4.14.0-rc.0
          hive.openshift.io/version-major: "4"
          hive.openshift.io/version-major-minor: "4.14"
          hive.openshift.io/version-major-minor-patch: 4.14.0
          vendor: OpenShift
        name: clc-auto-psi
        namespace: clc-auto-psi
        resourceVersion: "803680"
        uid: af6af255-ff7c-42d1-812d-7717eb9b40d5
      spec:
        baseDomain: dev09.red-chesterfield.com
        clusterMetadata:
          adminKubeconfigSecretRef:
            name: clc-auto-psi-0-nr48s-admin-kubeconfig
          adminPasswordSecretRef:
            name: clc-auto-psi-0-nr48s-admin-password
          clusterID: be7d71b6-f1f1-4d60-a20c-fefb7f45e157
          infraID: clc-auto-psi-cz4qp
        clusterName: clc-auto-psi
        controlPlaneConfig:
          servingCertificates: {}
        installAttemptsLimit: 1
        installed: true
        platform:
          openstack:
            certificatesSecretRef:
              name: clc-auto-psi-openstack-trust
            cloud: openstack
            credentialsSecretRef:
              name: clc-auto-psi-openstack-creds
        provisioning:
          imageSetRef:
            name: img4.14.0-rc.0-multi
          installConfigSecretRef:
            name: clc-auto-psi-install-config
          sshPrivateKeySecretRef:
            name: clc-auto-psi-ssh-private-key
        pullSecretRef:
          name: clc-auto-psi-pull-secret
      status:
        apiURL: https://api.clc-auto-psi.dev09.red-chesterfield.com:6443
        cliImage: quay.io/openshift-release-dev/ocp-v4.0-art-dev@sha256:c6fde16873a3def595063f2ae2a7ea786207d548fae3f4a174aab181cfd8207c
        conditions:
        - lastProbeTime: "2023-09-12T16:21:43Z"
          lastTransitionTime: "2023-09-12T16:21:43Z"
          message: 'Unsupported platform: no actuator to handle it'
          reason: Unsupported
          status: "False"
          type: Hibernating
        - lastProbeTime: "2023-09-12T15:41:48Z"
          lastTransitionTime: "2023-09-12T15:41:48Z"
          message: Platform credentials passed authentication check
          reason: PlatformAuthSuccess
          status: "False"
          type: AuthenticationFailure
        - lastProbeTime: "2023-09-12T16:21:43Z"
          lastTransitionTime: "2023-09-12T16:21:43Z"
          message: Control plane certificates are present
          reason: ControlPlaneCertificatesFound
          status: "False"
          type: ControlPlaneCertificateNotFound
        - lastProbeTime: "2023-09-12T15:42:00Z"
          lastTransitionTime: "2023-09-12T15:42:00Z"
          message: Images required for cluster deployment installations are resolved
          reason: ImagesResolved
          status: "False"
          type: InstallImagesNotResolved
        - lastProbeTime: "2023-09-12T15:42:24Z"
          lastTransitionTime: "2023-09-12T15:42:24Z"
          message: Successfully launched install pod
          reason: InstallLaunchSuccessful
          status: "False"
          type: InstallLaunchError
        - lastProbeTime: "2023-09-12T15:41:56Z"
          lastTransitionTime: "2023-09-12T15:41:56Z"
          message: InstallerImage is resolved.
          reason: InstallerImageResolved
          status: "False"
          type: InstallerImageResolutionFailed
        - lastProbeTime: "2023-09-12T16:21:43Z"
          lastTransitionTime: "2023-09-12T16:21:43Z"
          message: Provision clc-auto-psi-0-nr48s succeeded.
          reason: ProvisionSucceeded
          status: "False"
          type: ProvisionFailed
        - lastProbeTime: "2023-09-12T15:42:00Z"
          lastTransitionTime: "2023-09-12T15:42:00Z"
          message: Provision is not stopped
          reason: ProvisionNotStopped
          status: "False"
          type: ProvisionStopped
        - lastProbeTime: "2023-09-12T16:21:43Z"
          lastTransitionTime: "2023-09-12T16:21:43Z"
          message: Cluster is provisioned
          reason: Provisioned
          status: "True"
          type: Provisioned
        - lastProbeTime: "2023-09-12T16:21:43Z"
          lastTransitionTime: "2023-09-12T16:21:43Z"
          message: No power state actuator -- assuming running
          reason: Running
          status: "True"
          type: Ready
        - lastProbeTime: "2023-09-12T15:41:48Z"
          lastTransitionTime: "2023-09-12T15:41:48Z"
          message: no ClusterRelocates match
          reason: NoMatchingRelocates
          status: "False"
          type: RelocationFailed
        - lastProbeTime: "2023-09-12T15:42:00Z"
          lastTransitionTime: "2023-09-12T15:42:00Z"
          message: All pre-provision requirements met
          reason: AllRequirementsMet
          status: "True"
          type: RequirementsMet
        - lastProbeTime: "2023-09-12T16:21:45Z"
          lastTransitionTime: "2023-09-12T16:21:45Z"
          message: SyncSet apply is successful
          reason: SyncSetApplySuccess
          status: "False"
          type: SyncSetFailed
        - lastProbeTime: "2023-09-12T18:21:43Z"
          lastTransitionTime: "2023-09-12T16:21:43Z"
          message: cluster is reachable
          reason: ClusterReachable
          status: "False"
          type: Unreachable
        - lastProbeTime: "2023-09-12T15:41:48Z"
          lastTransitionTime: "2023-09-12T15:41:48Z"
          message: Condition Initialized
          reason: Initialized
          status: Unknown
          type: AWSPrivateLinkFailed
        - lastProbeTime: "2023-09-12T15:41:48Z"
          lastTransitionTime: "2023-09-12T15:41:48Z"
          message: Condition Initialized
          reason: Initialized
          status: Unknown
          type: AWSPrivateLinkReady
        - lastProbeTime: "2023-09-12T15:41:48Z"
          lastTransitionTime: "2023-09-12T15:41:48Z"
          message: Condition Initialized
          reason: Initialized
          status: Unknown
          type: ActiveAPIURLOverride
        - lastProbeTime: "2023-09-12T15:41:48Z"
          lastTransitionTime: "2023-09-12T15:41:48Z"
          message: Condition Initialized
          reason: Initialized
          status: Unknown
          type: ClusterInstallCompleted
        - lastProbeTime: "2023-09-12T15:41:48Z"
          lastTransitionTime: "2023-09-12T15:41:48Z"
          message: Condition Initialized
          reason: Initialized
          status: Unknown
          type: ClusterInstallFailed
        - lastProbeTime: "2023-09-12T15:41:48Z"
          lastTransitionTime: "2023-09-12T15:41:48Z"
          message: Condition Initialized
          reason: Initialized
          status: Unknown
          type: ClusterInstallRequirementsMet
        - lastProbeTime: "2023-09-12T15:41:48Z"
          lastTransitionTime: "2023-09-12T15:41:48Z"
          message: Condition Initialized
          reason: Initialized
          status: Unknown
          type: ClusterInstallStopped
        - lastProbeTime: "2023-09-12T15:41:48Z"
          lastTransitionTime: "2023-09-12T15:41:48Z"
          message: Condition Initialized
          reason: Initialized
          status: Unknown
          type: DNSNotReady
        - lastProbeTime: "2023-09-12T15:41:48Z"
          lastTransitionTime: "2023-09-12T15:41:48Z"
          message: Condition Initialized
          reason: Initialized
          status: Unknown
          type: DeprovisionLaunchError
        - lastProbeTime: "2023-09-12T15:41:48Z"
          lastTransitionTime: "2023-09-12T15:41:48Z"
          message: Condition Initialized
          reason: Initialized
          status: Unknown
          type: IngressCertificateNotFound
        installStartedTimestamp: "2023-09-12T15:42:00Z"
        installVersion: 4.14.0-rc.0
        installedTimestamp: "2023-09-12T16:21:43Z"
        installerImage: quay.io/openshift-release-dev/ocp-v4.0-art-dev@sha256:e4aa8f7b1caf1a4674d463e5d96987711cd77d9e83f33b912b02441b2cc15d13
        powerState: Running
        provisionRef:
          name: clc-auto-psi-0-nr48s
        webConsoleURL: https://console-openshift-console.apps.clc-auto-psi.dev09.red-chesterfield.com

      MachinePool (hub):

       apiVersion: v1
      items:
      - apiVersion: hive.openshift.io/v1
        kind: MachinePool
        metadata:
          creationTimestamp: "2023-09-12T15:41:47Z"
          finalizers:
          - hive.openshift.io/remotemachineset
          generation: 2
          name: clc-auto-psi-worker
          namespace: clc-auto-psi
          resourceVersion: "697731"
          uid: c3f267be-1996-41ec-b3c8-8eaffd130215
        spec:
          clusterDeploymentRef:
            name: clc-auto-psi
          name: worker
          platform:
            openstack:
              flavor: ocp-master-large
          replicas: 4
        status:
          conditions:
          - lastProbeTime: "2023-09-12T15:41:47Z"
            lastTransitionTime: "2023-09-12T15:41:47Z"
            message: Condition Initialized
            reason: Initialized
            status: Unknown
            type: NotEnoughReplicas
          - lastProbeTime: "2023-09-12T15:41:47Z"
            lastTransitionTime: "2023-09-12T15:41:47Z"
            message: Condition Initialized
            reason: Initialized
            status: Unknown
            type: NoMachinePoolNameLeasesAvailable
          - lastProbeTime: "2023-09-12T15:41:47Z"
            lastTransitionTime: "2023-09-12T15:41:47Z"
            message: Condition Initialized
            reason: Initialized
            status: Unknown
            type: InvalidSubnets
          - lastProbeTime: "2023-09-12T15:41:47Z"
            lastTransitionTime: "2023-09-12T15:41:47Z"
            message: Condition Initialized
            reason: Initialized
            status: Unknown
            type: UnsupportedConfiguration
      kind: List
      metadata:
        resourceVersion: ""

      MachineSet (cluster):

      NAME                          DESIRED   CURRENT   READY   AVAILABLE   AGE
      clc-auto-psi-cz4qp-worker-0   3         3         3       3           179m
      
      apiVersion: v1
      items:
      - apiVersion: machine.openshift.io/v1beta1
        kind: MachineSet
        metadata:
          annotations:
            machine.openshift.io/memoryMb: "16384"
            machine.openshift.io/vCPU: "16"
          creationTimestamp: "2023-09-12T15:52:02Z"
          generation: 1
          labels:
            machine.openshift.io/cluster-api-cluster: clc-auto-psi-cz4qp
            machine.openshift.io/cluster-api-machine-role: worker
            machine.openshift.io/cluster-api-machine-type: worker
          name: clc-auto-psi-cz4qp-worker-0
          namespace: openshift-machine-api
          resourceVersion: "31044"
          uid: 79555ee5-a675-4007-81c8-c70452c192a5
        spec:
          replicas: 3
          selector:
            matchLabels:
              machine.openshift.io/cluster-api-cluster: clc-auto-psi-cz4qp
              machine.openshift.io/cluster-api-machineset: clc-auto-psi-cz4qp-worker-0
          template:
            metadata:
              labels:
                machine.openshift.io/cluster-api-cluster: clc-auto-psi-cz4qp
                machine.openshift.io/cluster-api-machine-role: worker
                machine.openshift.io/cluster-api-machine-type: worker
                machine.openshift.io/cluster-api-machineset: clc-auto-psi-cz4qp-worker-0
            spec:
              lifecycleHooks: {}
              metadata: {}
              providerSpec:
                value:
                  apiVersion: machine.openshift.io/v1alpha1
                  cloudName: openstack
                  cloudsSecret:
                    name: openstack-cloud-credentials
                    namespace: openshift-machine-api
                  flavor: ocp-master-large
                  image: clc-auto-psi-cz4qp-rhcos
                  kind: OpenstackProviderSpec
                  metadata:
                    creationTimestamp: null
                  networks:
                  - filter: {}
                    subnets:
                    - filter:
                        name: clc-auto-psi-cz4qp-nodes
                        tags: openshiftClusterID=clc-auto-psi-cz4qp
                  securityGroups:
                  - filter: {}
                    name: clc-auto-psi-cz4qp-worker
                  serverGroupName: clc-auto-psi-cz4qp-worker
                  serverMetadata:
                    Name: clc-auto-psi-cz4qp-worker
                    openshiftClusterID: clc-auto-psi-cz4qp
                  tags:
                  - openshiftClusterID=clc-auto-psi-cz4qp
                  trunk: true
                  userDataSecret:
                    name: worker-user-data
        status:
          availableReplicas: 3
          fullyLabeledReplicas: 3
          observedGeneration: 1
          readyReplicas: 3
          replicas: 3
      kind: List
      metadata:
        resourceVersion: ""

      Machine (cluster):

       oc get machines.machine.openshift.io -n openshift-machine-api
      NAME                                PHASE     TYPE               REGION      ZONE   AGE
      clc-auto-psi-cz4qp-master-0         Running   ocp-master-large   regionOne   nova   3h
      clc-auto-psi-cz4qp-master-1         Running   ocp-master-large   regionOne   nova   3h
      clc-auto-psi-cz4qp-master-2         Running   ocp-master-large   regionOne   nova   3h
      clc-auto-psi-cz4qp-worker-0-8hl6f   Running   ocp-master-large   regionOne   nova   171m
      clc-auto-psi-cz4qp-worker-0-g48lm   Running   ocp-master-large   regionOne   nova   171m
      clc-auto-psi-cz4qp-worker-0-rbfz9   Running   ocp-master-large   regionOne   nova   171m

            magchen@redhat.com Maggie Chen
            rhn-support-dhuynh David Huynh
            Atif Shafi Atif Shafi
            ACM QE Team
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

              Created:
              Updated:
              Resolved: