Description of problem:
Altinfra build jobs are failing
Version-Release number of selected component (if applicable):
4.16
How reproducible:
Always
Steps to Reproduce:
1.Build master installer and use latest nightly 4.16 release image 2.Run CAPI enabled installer with FeatureSet CustomNoUpgrade and featureGates: ["ClusterAPIInstall=true"]
Actual results:
Cluster fails to complete boostrap
Expected results:
Cluster is able to install completely
Additional info:
This bug is to track investigation into why altinfra e2e jobs were failing for: https://prow.ci.openshift.org/job-history/gs/test-platform-results/pr-logs/directory/pull-ci-openshift-installer-master-altinfra-e2e-vsphere-capi-ovn Upon looking into it, etcd operator was not being created. We saw the following:
CVO:
402 17:18:59.959209 1 task.go:124] error running apply for etcd "cluster" (108 of 937): failed to get resource type: no matches for kind "Etcd" in version "operator.openshift.io/v1" E0402 17:19:03.862993 1 task.go:124] error running apply for etcd "cluster" (108 of 937): failed to get resource type: no matches for kind "Etcd" in version "operator.openshift.io/v1" E0402 17:19:09.157126 1 task.go:124] error running apply for etcd "cluster" (108 of 937): failed to get resource type: no matches for kind "Etcd" in version "operator.openshift.io/v1" I0402 17:19:20.234944 1 task_graph.go:550] Result of work: [Could not update etcd "cluster" (108 of 937): the server does not recognize this resource, check extension API servers Cluster operator kube-apiserver is not available Cluster operator machine-api is not available Cluster operator authentication is not available Cluster operator image-registry is not available Cluster operator ingress is not available Cluster operator monitoring is not available Cluster operator openshift-apiserver is not available Could not update rolebinding "openshift/cluster-samples-operator-openshift-edit" (536 of 937): resource may have been deleted Could not update oauthclient "console" (597 of 937): the server does not recognize this resource, check extension API servers Could not update imagestream "openshift/driver-toolkit" (659 of 937): resource may have been deleted Could not update role "openshift/copied-csv-viewer" (727 of 937): resource may have been deleted Could not update role "openshift-console-operator/prometheus-k8s" (855 of 937): resource may have been deleted Could not update role "openshift-console/prometheus-k8s" (859 of 937): resource may have been deleted] I0402 17:19:20.235037 1 sync_worker.go:1166] Update error 108 of 937: UpdatePayloadResourceTypeMissing Could not update etcd "cluster" (108 of 937): the server does not recognize this resource, check extension API servers (*errors.withStack: failed to get resource type: no matches for kind "Etcd" in version "operator.openshift.io/v1") * Could not update etcd "cluster" (108 of 937): the server does not recognize this resource, check extension API servers
- links to
-
RHEA-2024:0041 OpenShift Container Platform 4.16.z bug fix update