-
Bug
-
Resolution: Done-Errata
-
Major
-
ACM 2.9.0, MCE 2.4.0
-
False
-
None
-
False
-
-
-
Critical
-
Yes
Description of problem:
After provisioning an openstack cluster on 4.13.11 and 4.14.0-rc.0, we can see that the machine pools are at 0, and any scaling action does not do anything:
Scaling up to 4 does not initiate any provisioning on the cluster:
Version-Release number of selected component (if applicable):
ACM 2.9.0-DOWNSTREAM-2023-09-11-15-47-23
How reproducible:
Steps to Reproduce:
- deploy openstack cluster on 4.14.0-rc.0
- check machine pools for cluster
- ...
Actual results:
Expected results:
Additional info:
CD (hub):
apiVersion: hive.openshift.io/v1 kind: ClusterDeployment metadata:  annotations:   open-cluster-management.io/user-group: c3lzdGVtOmNsdXN0ZXItYWRtaW5zLHN5c3RlbTphdXRoZW50aWNhdGVk   open-cluster-management.io/user-identity: a3ViZTphZG1pbg==  creationTimestamp: "2023-09-12T15:41:47Z"  finalizers:  - hive.openshift.io/deprovision  generation: 3  labels:   cloud: OpenStack   cluster.open-cluster-management.io/clusterset: default   hive.openshift.io/cluster-platform: openstack   hive.openshift.io/cluster-region: unknown   hive.openshift.io/version: 4.14.0-rc.0   hive.openshift.io/version-major: "4"   hive.openshift.io/version-major-minor: "4.14"   hive.openshift.io/version-major-minor-patch: 4.14.0   vendor: OpenShift  name: clc-auto-psi  namespace: clc-auto-psi  resourceVersion: "803680"  uid: af6af255-ff7c-42d1-812d-7717eb9b40d5 spec:  baseDomain: dev09.red-chesterfield.com  clusterMetadata:   adminKubeconfigSecretRef:    name: clc-auto-psi-0-nr48s-admin-kubeconfig   adminPasswordSecretRef:    name: clc-auto-psi-0-nr48s-admin-password   clusterID: be7d71b6-f1f1-4d60-a20c-fefb7f45e157   infraID: clc-auto-psi-cz4qp  clusterName: clc-auto-psi  controlPlaneConfig:   servingCertificates: {}  installAttemptsLimit: 1  installed: true  platform:   openstack:    certificatesSecretRef:     name: clc-auto-psi-openstack-trust    cloud: openstack    credentialsSecretRef:     name: clc-auto-psi-openstack-creds  provisioning:   imageSetRef:    name: img4.14.0-rc.0-multi   installConfigSecretRef:    name: clc-auto-psi-install-config   sshPrivateKeySecretRef:    name: clc-auto-psi-ssh-private-key  pullSecretRef:   name: clc-auto-psi-pull-secret status:  apiURL: https://api.clc-auto-psi.dev09.red-chesterfield.com:6443  cliImage: quay.io/openshift-release-dev/ocp-v4.0-art-dev@sha256:c6fde16873a3def595063f2ae2a7ea786207d548fae3f4a174aab181cfd8207c  conditions:  - lastProbeTime: "2023-09-12T16:21:43Z"   lastTransitionTime: "2023-09-12T16:21:43Z"   message: 'Unsupported platform: no actuator to handle it'   reason: Unsupported   status: "False"   type: Hibernating  - lastProbeTime: "2023-09-12T15:41:48Z"   lastTransitionTime: "2023-09-12T15:41:48Z"   message: Platform credentials passed authentication check   reason: PlatformAuthSuccess   status: "False"   type: AuthenticationFailure  - lastProbeTime: "2023-09-12T16:21:43Z"   lastTransitionTime: "2023-09-12T16:21:43Z"   message: Control plane certificates are present   reason: ControlPlaneCertificatesFound   status: "False"   type: ControlPlaneCertificateNotFound  - lastProbeTime: "2023-09-12T15:42:00Z"   lastTransitionTime: "2023-09-12T15:42:00Z"   message: Images required for cluster deployment installations are resolved   reason: ImagesResolved   status: "False"   type: InstallImagesNotResolved  - lastProbeTime: "2023-09-12T15:42:24Z"   lastTransitionTime: "2023-09-12T15:42:24Z"   message: Successfully launched install pod   reason: InstallLaunchSuccessful   status: "False"   type: InstallLaunchError  - lastProbeTime: "2023-09-12T15:41:56Z"   lastTransitionTime: "2023-09-12T15:41:56Z"   message: InstallerImage is resolved.   reason: InstallerImageResolved   status: "False"   type: InstallerImageResolutionFailed  - lastProbeTime: "2023-09-12T16:21:43Z"   lastTransitionTime: "2023-09-12T16:21:43Z"   message: Provision clc-auto-psi-0-nr48s succeeded.   reason: ProvisionSucceeded   status: "False"   type: ProvisionFailed  - lastProbeTime: "2023-09-12T15:42:00Z"   lastTransitionTime: "2023-09-12T15:42:00Z"   message: Provision is not stopped   reason: ProvisionNotStopped   status: "False"   type: ProvisionStopped  - lastProbeTime: "2023-09-12T16:21:43Z"   lastTransitionTime: "2023-09-12T16:21:43Z"   message: Cluster is provisioned   reason: Provisioned   status: "True"   type: Provisioned  - lastProbeTime: "2023-09-12T16:21:43Z"   lastTransitionTime: "2023-09-12T16:21:43Z"   message: No power state actuator -- assuming running   reason: Running   status: "True"   type: Ready  - lastProbeTime: "2023-09-12T15:41:48Z"   lastTransitionTime: "2023-09-12T15:41:48Z"   message: no ClusterRelocates match   reason: NoMatchingRelocates   status: "False"   type: RelocationFailed  - lastProbeTime: "2023-09-12T15:42:00Z"   lastTransitionTime: "2023-09-12T15:42:00Z"   message: All pre-provision requirements met   reason: AllRequirementsMet   status: "True"   type: RequirementsMet  - lastProbeTime: "2023-09-12T16:21:45Z"   lastTransitionTime: "2023-09-12T16:21:45Z"   message: SyncSet apply is successful   reason: SyncSetApplySuccess   status: "False"   type: SyncSetFailed  - lastProbeTime: "2023-09-12T18:21:43Z"   lastTransitionTime: "2023-09-12T16:21:43Z"   message: cluster is reachable   reason: ClusterReachable   status: "False"   type: Unreachable  - lastProbeTime: "2023-09-12T15:41:48Z"   lastTransitionTime: "2023-09-12T15:41:48Z"   message: Condition Initialized   reason: Initialized   status: Unknown   type: AWSPrivateLinkFailed  - lastProbeTime: "2023-09-12T15:41:48Z"   lastTransitionTime: "2023-09-12T15:41:48Z"   message: Condition Initialized   reason: Initialized   status: Unknown   type: AWSPrivateLinkReady  - lastProbeTime: "2023-09-12T15:41:48Z"   lastTransitionTime: "2023-09-12T15:41:48Z"   message: Condition Initialized   reason: Initialized   status: Unknown   type: ActiveAPIURLOverride  - lastProbeTime: "2023-09-12T15:41:48Z"   lastTransitionTime: "2023-09-12T15:41:48Z"   message: Condition Initialized   reason: Initialized   status: Unknown   type: ClusterInstallCompleted  - lastProbeTime: "2023-09-12T15:41:48Z"   lastTransitionTime: "2023-09-12T15:41:48Z"   message: Condition Initialized   reason: Initialized   status: Unknown   type: ClusterInstallFailed  - lastProbeTime: "2023-09-12T15:41:48Z"   lastTransitionTime: "2023-09-12T15:41:48Z"   message: Condition Initialized   reason: Initialized   status: Unknown   type: ClusterInstallRequirementsMet  - lastProbeTime: "2023-09-12T15:41:48Z"   lastTransitionTime: "2023-09-12T15:41:48Z"   message: Condition Initialized   reason: Initialized   status: Unknown   type: ClusterInstallStopped  - lastProbeTime: "2023-09-12T15:41:48Z"   lastTransitionTime: "2023-09-12T15:41:48Z"   message: Condition Initialized   reason: Initialized   status: Unknown   type: DNSNotReady  - lastProbeTime: "2023-09-12T15:41:48Z"   lastTransitionTime: "2023-09-12T15:41:48Z"   message: Condition Initialized   reason: Initialized   status: Unknown   type: DeprovisionLaunchError  - lastProbeTime: "2023-09-12T15:41:48Z"   lastTransitionTime: "2023-09-12T15:41:48Z"   message: Condition Initialized   reason: Initialized   status: Unknown   type: IngressCertificateNotFound  installStartedTimestamp: "2023-09-12T15:42:00Z"  installVersion: 4.14.0-rc.0  installedTimestamp: "2023-09-12T16:21:43Z"  installerImage: quay.io/openshift-release-dev/ocp-v4.0-art-dev@sha256:e4aa8f7b1caf1a4674d463e5d96987711cd77d9e83f33b912b02441b2cc15d13  powerState: Running  provisionRef:   name: clc-auto-psi-0-nr48s  webConsoleURL: https://console-openshift-console.apps.clc-auto-psi.dev09.red-chesterfield.com
MachinePool (hub):
apiVersion: v1 items: - apiVersion: hive.openshift.io/v1  kind: MachinePool  metadata:   creationTimestamp: "2023-09-12T15:41:47Z"   finalizers:   - hive.openshift.io/remotemachineset   generation: 2   name: clc-auto-psi-worker   namespace: clc-auto-psi   resourceVersion: "697731"   uid: c3f267be-1996-41ec-b3c8-8eaffd130215  spec:   clusterDeploymentRef:    name: clc-auto-psi   name: worker   platform:    openstack:     flavor: ocp-master-large   replicas: 4  status:   conditions:   - lastProbeTime: "2023-09-12T15:41:47Z"    lastTransitionTime: "2023-09-12T15:41:47Z"    message: Condition Initialized    reason: Initialized    status: Unknown    type: NotEnoughReplicas   - lastProbeTime: "2023-09-12T15:41:47Z"    lastTransitionTime: "2023-09-12T15:41:47Z"    message: Condition Initialized    reason: Initialized    status: Unknown    type: NoMachinePoolNameLeasesAvailable   - lastProbeTime: "2023-09-12T15:41:47Z"    lastTransitionTime: "2023-09-12T15:41:47Z"    message: Condition Initialized    reason: Initialized    status: Unknown    type: InvalidSubnets   - lastProbeTime: "2023-09-12T15:41:47Z"    lastTransitionTime: "2023-09-12T15:41:47Z"    message: Condition Initialized    reason: Initialized    status: Unknown    type: UnsupportedConfiguration kind: List metadata:  resourceVersion: ""
MachineSet (cluster):
NAME              DESIRED  CURRENT  READY  AVAILABLE  AGE clc-auto-psi-cz4qp-worker-0  3     3     3    3      179m apiVersion: v1 items: - apiVersion: machine.openshift.io/v1beta1  kind: MachineSet  metadata:   annotations:    machine.openshift.io/memoryMb: "16384"    machine.openshift.io/vCPU: "16"   creationTimestamp: "2023-09-12T15:52:02Z"   generation: 1   labels:    machine.openshift.io/cluster-api-cluster: clc-auto-psi-cz4qp    machine.openshift.io/cluster-api-machine-role: worker    machine.openshift.io/cluster-api-machine-type: worker   name: clc-auto-psi-cz4qp-worker-0   namespace: openshift-machine-api   resourceVersion: "31044"   uid: 79555ee5-a675-4007-81c8-c70452c192a5  spec:   replicas: 3   selector:    matchLabels:     machine.openshift.io/cluster-api-cluster: clc-auto-psi-cz4qp     machine.openshift.io/cluster-api-machineset: clc-auto-psi-cz4qp-worker-0   template:    metadata:     labels:      machine.openshift.io/cluster-api-cluster: clc-auto-psi-cz4qp      machine.openshift.io/cluster-api-machine-role: worker      machine.openshift.io/cluster-api-machine-type: worker      machine.openshift.io/cluster-api-machineset: clc-auto-psi-cz4qp-worker-0    spec:     lifecycleHooks: {}     metadata: {}     providerSpec:      value:       apiVersion: machine.openshift.io/v1alpha1       cloudName: openstack       cloudsSecret:        name: openstack-cloud-credentials        namespace: openshift-machine-api       flavor: ocp-master-large       image: clc-auto-psi-cz4qp-rhcos       kind: OpenstackProviderSpec       metadata:        creationTimestamp: null       networks:       - filter: {}        subnets:        - filter:          name: clc-auto-psi-cz4qp-nodes          tags: openshiftClusterID=clc-auto-psi-cz4qp       securityGroups:       - filter: {}        name: clc-auto-psi-cz4qp-worker       serverGroupName: clc-auto-psi-cz4qp-worker       serverMetadata:        Name: clc-auto-psi-cz4qp-worker        openshiftClusterID: clc-auto-psi-cz4qp       tags:       - openshiftClusterID=clc-auto-psi-cz4qp       trunk: true       userDataSecret:        name: worker-user-data  status:   availableReplicas: 3   fullyLabeledReplicas: 3   observedGeneration: 1   readyReplicas: 3   replicas: 3 kind: List metadata:  resourceVersion: ""
Machine (cluster):
oc get machines.machine.openshift.io -n openshift-machine-api NAME                 PHASE   TYPE        REGION    ZONE  AGE clc-auto-psi-cz4qp-master-0     Running  ocp-master-large  regionOne  nova  3h clc-auto-psi-cz4qp-master-1     Running  ocp-master-large  regionOne  nova  3h clc-auto-psi-cz4qp-master-2     Running  ocp-master-large  regionOne  nova  3h clc-auto-psi-cz4qp-worker-0-8hl6f  Running  ocp-master-large  regionOne  nova  171m clc-auto-psi-cz4qp-worker-0-g48lm  Running  ocp-master-large  regionOne  nova  171m clc-auto-psi-cz4qp-worker-0-rbfz9  Running  ocp-master-large  regionOne  nova  171m
- clones
-
HIVE-2360 Machine Pool scaling doesn't work for Openstack cluster
- Closed
- is caused by
-
ACM-6818 Updating the Version of Hive in each MCE releases
- Closed
- is cloned by
-
ACM-10633 Machine Pool scaling doesn't work for Openstack cluster
- New
- is duplicated by
-
ACM-9162 machinepools are not manually scalable through ACM in 2.9
- Closed
- is related to
-
MCO-778 ocp 4.14 vSphere cluster node stuck in Provisioned state
- Closed
-
ACM-8726 Remove cert reference from the ClusterDeployment file in Openstack
- Closed
- links to
-
RHSA-2024:126795 Red Hat Advanced Cluster Management 2.9.3 security and bug fix container updates