-
Epic
-
Resolution: Unresolved
-
Undefined
-
None
-
None
-
None
-
nova-operator CI flakiness
-
False
-
-
False
-
Not Selected
-
?
-
?
-
To Do
-
Root Cause Analysis
-
?
-
rhos-workloads-compute
-
?
-
100% To Do, 0% In Progress, 0% Done
-
-
-
nova-operator PRs require too many rechecks to pass CI due to flaky jobs, PRs experience failures unrelated to code changes and wasting CI resources
PRs that should pass are failing intermittently, requiring multiple `/recheck`|`/retest` commands with no code changes to eventually merge. This pattern is worsening and now affects the majority of PRs.
few examples I could list:
- https://github.com/openstack-k8s-operators/nova-operator/pull/1052
- https://github.com/openstack-k8s-operators/nova-operator/pull/1055
- https://github.com/openstack-k8s-operators/nova-operator/pull/1066
- https://github.com/openstack-k8s-operators/nova-operator/pull/1065
- https://github.com/openstack-k8s-operators/nova-operator/pull/1054
Most common failing job: openstack-meta-content-provider
Goal:
- Investigate adn indentify the root causes for CI job failures
- Document and fix
- OR improve error message to understand its code change issue or any other
Overall to reduce requirement of empty rechecks
Acceptance Criteria:
- team experience
- Zuul build dashboard