-
Bug
-
Resolution: Not a Bug
-
Major
-
None
-
4.17.z
-
None
-
Quality / Stability / Reliability
-
False
-
-
None
-
Important
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
Description of problem:
OCP cluster install fails on Nutanix flow env, nodes tainted with node.cloudprovider.kubernetes.io/uninitialized=true:NoSchedule
Version-Release number of selected component (if applicable):
4.17.0-0.nightly-2024-10-22-215145
How reproducible:
Always
Steps to Reproduce:
1. Install OCP on Nutanix flow env failed with error
# oc get clusterversion
NAME VERSION AVAILABLE PROGRESSING SINCE STATUS
version False True 4m29s Unable to apply 4.17.0-0.nightly-2024-10-22-215145: an unknown error has occurred: MultipleErrors
# oc get nodes
NAME STATUS ROLES AGE VERSION
sgao-flow-n9xdl-master-0 NotReady control-plane,master 2m22s v1.30.5
sgao-flow-n9xdl-master-1 NotReady control-plane,master 2m28s v1.30.5
sgao-flow-n9xdl-master-2 NotReady control-plane,master 2m23s v1.30.5
# oc describe node sgao-flow-n9xdl-master-0
Name: sgao-flow-n9xdl-master-0
Roles: control-plane,master
Labels: beta.kubernetes.io/arch=amd64
beta.kubernetes.io/os=linux
kubernetes.io/arch=amd64
kubernetes.io/hostname=sgao-flow-n9xdl-master-0
kubernetes.io/os=linux
node-role.kubernetes.io/control-plane=
node-role.kubernetes.io/master=
node.openshift.io/os_id=rhcos
Annotations: alpha.kubernetes.io/provided-node-ip: 10.10.1.37
volumes.kubernetes.io/controller-managed-attach-detach: true
CreationTimestamp: Wed, 23 Oct 2024 02:48:16 -0400
Taints: node-role.kubernetes.io/master:NoSchedule
node.cloudprovider.kubernetes.io/uninitialized=true:NoSchedule
node.kubernetes.io/not-ready:NoSchedule
Unschedulable: false
Lease:
HolderIdentity: sgao-flow-n9xdl-master-0
AcquireTime: <unset>
RenewTime: Wed, 23 Oct 2024 02:53:31 -0400
Conditions:
Type Status LastHeartbeatTime LastTransitionTime Reason Message
---- ------ ----------------- ------------------ ------ -------
MemoryPressure False Wed, 23 Oct 2024 02:48:57 -0400 Wed, 23 Oct 2024 02:48:16 -0400 KubeletHasSufficientMemory kubelet has sufficient memory available
DiskPressure False Wed, 23 Oct 2024 02:48:57 -0400 Wed, 23 Oct 2024 02:48:16 -0400 KubeletHasNoDiskPressure kubelet has no disk pressure
PIDPressure False Wed, 23 Oct 2024 02:48:57 -0400 Wed, 23 Oct 2024 02:48:16 -0400 KubeletHasSufficientPID kubelet has sufficient PID available
Ready False Wed, 23 Oct 2024 02:48:57 -0400 Wed, 23 Oct 2024 02:48:16 -0400 KubeletNotReady container runtime network not ready: NetworkReady=false reason:NetworkPluginNotReady message:Network plugin returns error: no CNI configuration file in /etc/kubernetes/cni/net.d/. Has your network provider started?
Addresses:
InternalIP: 10.10.1.37
Hostname: sgao-flow-n9xdl-master-0
Capacity:
cpu: 8
ephemeral-storage: 125238252Ki
hugepages-1Gi: 0
hugepages-2Mi: 0
memory: 16306408Ki
pods: 250
Allocatable:
cpu: 7500m
ephemeral-storage: 114345831029
hugepages-1Gi: 0
hugepages-2Mi: 0
memory: 15155432Ki
pods: 250
System Info:
Machine ID: 07aef2cad3a74b51bd5f9123d6b8d7c0
System UUID: 07aef2ca-d3a7-4b51-bd5f-9123d6b8d7c0
Boot ID: bf73ad9c-4f58-4a3e-96c3-a745c8b337af
Kernel Version: 5.14.0-427.42.1.el9_4.x86_64
OS Image: Red Hat Enterprise Linux CoreOS 417.94.202410211619-0
Operating System: linux
Architecture: amd64
Container Runtime Version: cri-o://1.30.6-6.rhaos4.17.git6ac6e96.el9
Kubelet Version: v1.30.5
Kube-Proxy Version: v1.30.5
Non-terminated Pods: (5 in total)
Namespace Name CPU Requests CPU Limits Memory Requests Memory Limits Age
--------- ---- ------------ ---------- --------------- ------------- ---
openshift-cloud-controller-manager-operator cluster-cloud-controller-manager-operator-6669679c46-xh55m 30m (0%) 0 (0%) 95Mi (0%) 0 (0%) 4m36s
openshift-machine-config-operator kube-rbac-proxy-crio-sgao-flow-n9xdl-master-0 20m (0%) 0 (0%) 50Mi (0%) 0 (0%) 5m
openshift-nutanix-infra coredns-sgao-flow-n9xdl-master-0 200m (2%) 0 (0%) 400Mi (2%) 0 (0%) 4m34s
openshift-nutanix-infra haproxy-sgao-flow-n9xdl-master-0 200m (2%) 0 (0%) 400Mi (2%) 0 (0%) 3m56s
openshift-nutanix-infra keepalived-sgao-flow-n9xdl-master-0 200m (2%) 0 (0%) 400Mi (2%) 0 (0%) 4m33s
Allocated resources:
(Total limits may be over 100 percent, i.e., overcommitted.)
Resource Requests Limits
-------- -------- ------
cpu 650m (8%) 0 (0%)
memory 1345Mi (9%) 0 (0%)
ephemeral-storage 0 (0%) 0 (0%)
hugepages-1Gi 0 (0%) 0 (0%)
hugepages-2Mi 0 (0%) 0 (0%)
Events:
Type Reason Age From Message
---- ------ ---- ---- -------
Normal RegisteredNode 5m16s node-controller Node sgao-flow-n9xdl-master-0 event: Registered Node sgao-flow-n9xdl-master-0 in Controller
Actual results:
Install failed
Expected results:
Install success
Additional info:
Thread discussion pls see https://redhat-external.slack.com/archives/C0211848DBN/p1728357973803949?thread_ts=1727883651.537649&cid=C0211848DBN