-
Bug
-
Resolution: Duplicate
-
Critical
-
None
-
4.16
-
Quality / Stability / Reliability
-
False
-
-
None
-
Critical
-
None
-
None
-
None
-
Proposed
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
Description of problem:
machine-api CO fails to start on the baremetal cluster:
oc get co machine-api
NAME VERSION AVAILABLE PROGRESSING DEGRADED SINCE MESSAGE
machine-api False True True 135m Operator is initializing
apiVersion: config.openshift.io/v1
kind: ClusterOperator
metadata:
annotations:
capability.openshift.io/name: MachineAPI
exclude.release.openshift.io/internal-openshift-hosted: "true"
include.release.openshift.io/self-managed-high-availability: "true"
include.release.openshift.io/single-node-developer: "true"
creationTimestamp: "2024-05-21T10:42:54Z"
generation: 1
name: machine-api
ownerReferences:
- apiVersion: config.openshift.io/v1
controller: true
kind: ClusterVersion
name: version
uid: 11e027a9-e1d7-476b-ab55-ce028a51b006
resourceVersion: "12511"
uid: 8671f8e4-f379-44b0-aa43-e3a76895525c
spec: {}
status:
conditions:
- lastTransitionTime: "2024-05-21T10:50:42Z"
message: 'Progressing towards operator: 4.16.0-rc.2'
reason: SyncingResources
status: "True"
type: Progressing
- lastTransitionTime: "2024-05-21T10:53:49Z"
message: 'Failed when progressing towards operator: 4.16.0-rc.2 because minimum
worker replica count (2) not yet met: current running replicas 0, waiting for
[ecore-vkxlb-worker-0-2bcfv ecore-vkxlb-worker-0-8fh2g ecore-vkxlb-worker-0-frjcw
ecore-vkxlb-worker-0-lnqrh]'
reason: SyncingFailed
status: "True"
type: Degraded
- lastTransitionTime: "2024-05-21T10:50:42Z"
message: Operator is initializing
reason: Initializing
status: "False"
type: Available
- lastTransitionTime: "2024-05-21T10:50:42Z"
status: "True"
type: Upgradeable
extension: null
relatedObjects:
- group: ""
name: openshift-machine-api
resource: namespaces
- group: machine.openshift.io
name: ""
namespace: openshift-machine-api
resource: machines
- group: machine.openshift.io
name: ""
namespace: openshift-machine-api
resource: machinesets
- group: machine.openshift.io
name: ""
namespace: openshift-machine-api
resource: machinehealthchecks
- group: rbac.authorization.k8s.io
name: ""
namespace: openshift-machine-api
resource: roles
- group: rbac.authorization.k8s.io
name: machine-api-operator
resource: clusterroles
- group: rbac.authorization.k8s.io
name: machine-api-controllers
resource: clusterroles
- group: metal3.io
name: ""
namespace: openshift-machine-api
resource: baremetalhosts
Version-Release number of selected component (if applicable):
4.16.0-rc.2
How reproducible:
so far once
Steps to Reproduce:
1. Install baremetal SNO HUB cluster
2. Install ACM/MCE version 2.11
3. Install TALM, GitOps operators
4. Start baremetal cluster deployment with GitOps ZTP workflow
Actual results:
Deployment is stuck:
oc get clusterdeployments.hive.openshift.io -n ecore
NAME INFRAID PLATFORM REGION VERSION CLUSTERTYPE PROVISIONSTATUS POWERSTATE AGE
ecore agent-baremetal Provisioning 164m
Expected results:
machine-api operator is started on the spoke cluster
Additional info:
Dualstack baremetal setup