-
Bug
-
Resolution: Duplicate
-
Critical
-
None
-
4.16
-
None
-
Yes
-
Proposed
-
False
-
Description of problem:
When Performed Node stop and start operation on ibmcloud deployment cluster becomes inaccessible $ oc get nodes E0514 13:27:38.560306 511051 memcache.go:265] couldn't get current server API group list: Get "https://api.prsurve-ibmojf.ibmcloud2.qe.rh-ocs.com:6443/api?timeout=32s": EOF E0514 13:27:50.530433 511051 memcache.go:265] couldn't get current server API group list: Get "https://api.prsurve-ibmojf.ibmcloud2.qe.rh-ocs.com:6443/api?timeout=32s": EOF E0514 13:28:02.517502 511051 memcache.go:265] couldn't get current server API group list: Get "https://api.prsurve-ibmojf.ibmcloud2.qe.rh-ocs.com:6443/api?timeout=32s": EOF
Version-Release number of selected component (if applicable):
4.16.0-0.nightly-2024-05-08-222442
How reproducible:
- Always
Steps to Reproduce:
1. Deploy an IBM Cloud IPI cluster 2. stop all nodes using ibmcloud is instance-stop <node-name> --force=true 3. after few min start the nodes again ibmcloud is instance-start <node-name>
Actual results:
$ oc get nodes E0514 13:27:38.560306 511051 memcache.go:265] couldn't get current server API group list: Get "https://api.prsurve-ibmojf.ibmcloud2.qe.rh-ocs.com:6443/api?timeout=32s": EOF E0514 13:27:50.530433 511051 memcache.go:265] couldn't get current server API group list: Get "https://api.prsurve-ibmojf.ibmcloud2.qe.rh-ocs.com:6443/api?timeout=32s": EOF E0514 13:28:02.517502 511051 memcache.go:265] couldn't get current server API group list: Get "https://api.prsurve-ibmojf.ibmcloud2.qe.rh-ocs.com:6443/api?timeout=32s": EOF when check the vnc console, vm are operational.
Expected results:
Cluster should recover and it should be operational
Additional info:
Command output: https://url.corp.redhat.com/56e6800
- is duplicated by
-
OCPBUGS-33562 Cluster API is not accessible after all nodes are stopped and restarted during chaos testing
- Closed