Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Normal
Fix Version/s: None
Affects Version/s: None
Component/s: None
Labels:

Blocked:
False
Blocked Reason:
None
Ready:
False
Epic Link:
OVN As Default CI Coverage
[QE] How to address?:
---

Cost of Delay:
0
WSJF:
0

SFDC Cases Links:
SFDC Cases Counter:
SFDC Cases Open:

job link

must-gather

snippet from test output:

{  4 events happened too frequently

event happened 22 times, something is wrong: ns/openshift-cluster-storage-operator deployment/csi-snapshot-controller-operator - reason/OperatorStatusChanged Status for clusteroperator/csi-snapshot-controller changed: Progressing message changed from "CSISnapshotControllerProgressing: Waiting for Deployment to deploy csi-snapshot-controller pods" to "CSISnapshotControllerProgressing: Waiting for Deployment to deploy csi-snapshot-controller pods\nCSISnapshotWebhookControllerProgressing: desired generation 2, current generation 1"
event happened 24 times, something is wrong: ns/openshift-cluster-storage-operator deployment/csi-snapshot-controller-operator - reason/OperatorStatusChanged Status for clusteroperator/csi-snapshot-controller changed: Progressing message changed from "CSISnapshotControllerProgressing: Waiting for Deployment to deploy csi-snapshot-controller pods\nCSISnapshotWebhookControllerProgressing: desired generation 2, current generation 1" to "CSISnapshotControllerProgressing: Waiting for Deployment to deploy csi-snapshot-controller pods"
event happened 23 times, something is wrong: ns/openshift-cluster-storage-operator deployment/csi-snapshot-controller-operator - reason/OperatorStatusChanged Status for clusteroperator/csi-snapshot-controller changed: Progressing message changed from "CSISnapshotControllerProgressing: Waiting for Deployment to deploy csi-snapshot-controller pods\nCSISnapshotWebhookControllerProgressing: 1 out of 2 pods running" to "CSISnapshotControllerProgressing: Waiting for Deployment to deploy csi-snapshot-controller pods\nCSISnapshotWebhookControllerProgressing: desired generation 2, current generation 1"
event happened 23 times, something is wrong: ns/openshift-cluster-storage-operator deployment/csi-snapshot-controller-operator - reason/OperatorStatusChanged Status for clusteroperator/csi-snapshot-controller changed: Progressing message changed from "CSISnapshotControllerProgressing: Waiting for Deployment to deploy csi-snapshot-controller pods\nCSISnapshotWebhookControllerProgressing: desired generation 2, current generation 1" to "CSISnapshotControllerProgressing: Waiting for Deployment to deploy csi-snapshot-controller pods\nCSISnapshotWebhookControllerProgressing: 1 out of 2 pods running"}

This failure is coming from a check that "[sig-arch] events should not repeat pathologically". Essentially looking for troubling events
that occur more than X (I think it's 20) number of times. This particular issue around CSISnapshotWebhookControllerProgressing seems
to happen in our ovn-upgrade job periodicially. I only saw one case of it happening in an openshift-sdn job and that was in a
slightly more complicated upgrade-rollback job. I think this one is worth chasing down since it seems someone affected by
ovn.

There was a bug matching this kind of problem filed back in March but it has since been
marked RESOLVED so I don't think anyone is actively looking at this any more.

here's a search.ci link that shows all the jobs that have this problem for our aws-ovn-upgrade jobs over the last 7 days.

link to this job's testgrid for reference.

is duplicated by

SDN-2990 failure in test cases for the etcdHighNumberOfFailedGRPCRequests alert

Closed

Assignee:: Jamo Luhrsen

Reporter:: Jamo Luhrsen

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2022/05/12 9:46 PM

Updated:: 2022/07/27 8:59 PM

Resolved:: 2022/07/27 8:59 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates