Loading...

XML

Word

Printable

Type: Bug
Resolution: Done-Errata
Priority: Critical
Fix Version/s: 4.16
Affects Version/s: 4.16
Component/s: Etcd
Labels:
None

Severity:
Important
Regression:
No
Release Blocker:
Proposed
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Release Note Type:
Release Note Not Required
Release Note Status:
In Progress
Target Version:

4.16.0

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Description of problem:

Altinfra build jobs are failing

Version-Release number of selected component (if applicable):

4.16

How reproducible:

Always

Steps to Reproduce:

1.Build master installer and use latest nightly 4.16 release image
2.Run CAPI enabled installer with FeatureSet CustomNoUpgrade and featureGates: ["ClusterAPIInstall=true"]

Actual results:

Cluster fails to complete boostrap

Expected results:

Cluster is able to install completely

Additional info:

This bug is to track investigation into why altinfra e2e jobs were failing for:
https://prow.ci.openshift.org/job-history/gs/test-platform-results/pr-logs/directory/pull-ci-openshift-installer-master-altinfra-e2e-vsphere-capi-ovn
Upon looking into it, etcd operator was not being created.  We saw the following:

CVO:

402 17:18:59.959209       1 task.go:124] error running apply for etcd "cluster" (108 of 937): failed to get resource type: no matches for kind "Etcd" in version "operator.openshift.io/v1"
E0402 17:19:03.862993       1 task.go:124] error running apply for etcd "cluster" (108 of 937): failed to get resource type: no matches for kind "Etcd" in version "operator.openshift.io/v1"
E0402 17:19:09.157126       1 task.go:124] error running apply for etcd "cluster" (108 of 937): failed to get resource type: no matches for kind "Etcd" in version "operator.openshift.io/v1"
I0402 17:19:20.234944       1 task_graph.go:550] Result of work: [Could not update etcd "cluster" (108 of 937): the server does not recognize this resource, check extension API servers Cluster operator kube-apiserver is not available Cluster operator machine-api is not available Cluster operator authentication is not available Cluster operator image-registry is not available Cluster operator ingress is not available Cluster operator monitoring is not available Cluster operator openshift-apiserver is not available Could not update rolebinding "openshift/cluster-samples-operator-openshift-edit" (536 of 937): resource may have been deleted Could not update oauthclient "console" (597 of 937): the server does not recognize this resource, check extension API servers Could not update imagestream "openshift/driver-toolkit" (659 of 937): resource may have been deleted Could not update role "openshift/copied-csv-viewer" (727 of 937): resource may have been deleted Could not update role "openshift-console-operator/prometheus-k8s" (855 of 937): resource may have been deleted Could not update role "openshift-console/prometheus-k8s" (859 of 937): resource may have been deleted]
I0402 17:19:20.235037       1 sync_worker.go:1166] Update error 108 of 937: UpdatePayloadResourceTypeMissing Could not update etcd "cluster" (108 of 937): the server does not recognize this resource, check extension API servers (*errors.withStack: failed to get resource type: no matches for kind "Etcd" in version "operator.openshift.io/v1")
* Could not update etcd "cluster" (108 of 937): the server does not recognize this resource, check extension API servers

links to

openshift/cluster-etcd-operator#1233: OCPBUGS-31658: Fix feature set custom no upgrade

RHEA-2024:0041 OpenShift Container Platform 4.16.z bug fix update

Assignee:: Neil Girard

Reporter:: Neil Girard

QA Contact:: Ge Liu

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Created:: 2024/04/02 7:53 PM

Updated:: 2024/06/27 11:42 AM

Resolved:: 2024/06/27 11:42 AM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates