-
Bug
-
Resolution: Unresolved
-
Undefined
-
None
-
4.20
-
None
-
False
-
-
None
-
Important
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
Description of problem:
During cluster's scale in BMH resources for respective nodes ain't removed but stuck in `deleting` state:
oc get bmh
NAME STATE CONSUMER ONLINE ERROR AGE
master-0 provisioned true 24h
master-1 provisioned true 24h
master-2 provisioned true 24h
worker-0 provisioned true 24h
worker-1 deleting true provisioning error 24h
worker-2 deleting true provisioning error 24h
worker-3 provisioned true 24h
oc describe bmh worker-1
...
Events:
Type Reason Age From Message
---- ------ ---- ---- -------
Normal Registered 30m metal3-baremetal-controller Registered new host
Normal BMCAccessValidated 30m metal3-baremetal-controller Verified access to BMC
Normal BMCAccessValidated 30m metal3-baremetal-controller Verified access to BMC
Normal ProvisioningError 30m metal3-baremetal-controller Cleaning failed: Failed to prepare node 343876eb-df14-484e-9de9-09106811966a for cleaning: Validation of image href file:///templates/uefi_esp.img failed, reason: Specified image file not found.
Normal ProvisioningError 28m metal3-baremetal-controller Cleaning failed: Failed to prepare node 343876eb-df14-484e-9de9-09106811966a for cleaning: Validation of image href file:///templates/uefi_esp.img failed, reason: Specified image file not found.
Normal ProvisioningError 25m metal3-baremetal-controller Cleaning failed: Failed to prepare node 343876eb-df14-484e-9de9-09106811966a for cleaning: Validation of image href file:///templates/uefi_esp.img failed, reason: Specified image file not found.
Normal BMCAccessValidated 18m metal3-baremetal-controller Verified access to BMC
Normal BMCAccessValidated 18m metal3-baremetal-controller Verified access to BMC
Normal ProvisioningError 18m metal3-baremetal-controller Cleaning failed: Failed to prepare node 343876eb-df14-484e-9de9-09106811966a for cleaning: Validation of image href file:///templates/uefi_esp.img failed, reason: Specified image file not found.
Version-Release number of selected component (if applicable):
OCP 4.20.9 advanced-cluster-management.v2.15.0 multicluster-engine.v2.10.0
How reproducible:
So far reproduced on 2 setups
Steps to Reproduce:
1. Deploy barematal multinode cluster using GitOps ZTP deployment with `ClusterInstance` CR and siteconfig operator
2. Follow scale in procedure
3. Check that BMH resources are in `deleting` state
Actual results:
Nodes are removed from OpenShift cluster(`oc get nodes` doesn't show removed nodes). Nodes are powered off.
BMH resources stuck in `deleting` state
Expected results:
BMH resources are removed
Additional info: