Loading...

XML

Word

Printable

Type: Task
Resolution: Unresolved
Priority: Normal
Fix Version/s: None
Affects Version/s: None
Component/s: CI
Labels:
- maintenance

Activity Type:
Quality / Stability / Reliability
Epic Link:
ROX-25640
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Intelligence Requested:
Market:

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Overview:

There have been a couple of CI failures for the scanner-v4-install-tests test suite (e.g. ROX-28514), which are caused by temporary unavailability of the Kube API server during `helm install` invocations.

In such a situation `helm install` fails as follows:

INFO: Wed Mar 12 10:18:00 UTC 2025: [deploy-stackrox] Error: rendered manifests contain a resource that already exists. Unable to continue with install: could not get information about the resource ClusterRoleBinding "stackrox:review-tokens-binding" in namespace "": an error on the server ("Internal Server Error: \"/apis/rbac.authorization.k8s.io/v1/clusterrolebindings/stackrox:review-tokens-binding\": the server is currently unable to handle the request") has prevented the request from succeeding (get clusterrolebindings.rbac.authorization.k8s.io stackrox:review-tokens-binding)

Apparently `helm` does not currently have built-in functionality for doing retries automatically. Therefore we might be forced to write our own helm CLI wrapper, similar to the `retry-kubectl.sh` script, which we are using already in CI.

Assignee:: Unassigned

Reporter:: Moritz Clasmeier

Team:: ACS Install

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Created:: 2025/03/19 1:09 PM

Updated:: 2025/07/01 2:10 PM

Details

Description

Overview:

Attachments

Easy Agile Planning Poker

Activity

People

Dates