Loading...

XML

Word

Printable

Type: Bug
Resolution: Done-Errata
Priority: Critical
Fix Version/s: 4.19.0
Affects Version/s: 4.19
Component/s: Storage
Labels:
- qe-premerge-tested

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
Critical
Regression:
Yes

Target Backport Versions:
None
Target Version:

4.19.0
Release Blocker:
Approved
Sprint:
None

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Release Note Status:
In Progress
Release Note Type:
Release Note Not Required
Release Note Text:
None

Escape Reason:
None
Escape Impact:
None
Corrective Measures:
None
SDLC stage when should've been found:
None

Description of problem:

Storage operator is degraded on 4.19 installs to azure stack:

storage                                    4.19.0-0.ci.test-2025-03-26-183004-ci-ln-q7lhwk2-latest   False       True          False      76s     AzureDiskCSIDriverOperatorCRAvailable: AzureDiskDriverNodeServiceControllerAvailable: Waiting for the DaemonSet to deploy the CSI Node Service

Pod logs show a permission error

[root@fedora auth]# oc logs azure-disk-csi-driver-node-6c44h -n openshift-cluster-csi-drivers csi-driver | tail -n 1
E0331 00:57:40.711688       1 utils.go:110] GRPC error: rpc error: code = Internal desc = getNodeInfoFromLabels on node(padillon03271650-4x84l-worker-mtcazs-r6584) failed with get node(padillon03271650-4x84l-worker-mtcazs-r6584) failed with nodes "padillon03271650-4x84l-worker-mtcazs-r6584" is forbidden: User "system:serviceaccount:openshift-cluster-csi-drivers:azure-disk-csi-driver-node-sa" cannot get resource "nodes" in API group "" at the cluster scope

I did not look into the source of this call to get nodes. Checking the rbac:

# oc describe clusterrole azure-disk-privileged-role
Name:         azure-disk-privileged-role
Labels:       <none>
Annotations:  <none>
PolicyRule:
  Resources                                         Non-Resource URLs  Resource Names  Verbs
  ---------                                         -----------------  --------------  -----
  securitycontextconstraints.security.openshift.io  []                 [privileged]    [use]

Not sure what to make of this. Perhaps an upstream change? It could always be azure stack weirdness, more context below.

Version-Release number of selected component (if applicable):

    4.19ec3

How reproducible:

    Always

Steps to Reproduce:

All Azure Stack installs.

Actual results:

    Degraded operator

Expected results:

    Available

Additional info:

1. https://issues.redhat.com/browse/OCPBUGS-51090 tracks an upstream bug in cloud-provider-azure which I have an upstream wip fix for here: https://github.com/kubernetes-sigs/cloud-provider-azure/pull/8755. Upstream changed the sdk in the cloud provider to the v2 implementation which still has spotty at best support for azure stack. 

If the storage operator depends on node labels, this cloud provider bug could be the cause.

2.CI is down because new security measures were put in place for our environment. Manual token validation is now required. They are meeting on monday about enabling accessed from the fixed ip address we have given them.

Must gather attached

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

must-gather-ash.tar.gz
77.40 MB
2025/03/31 1:23 AM

depends on

OCPBUGS-55052 [CSI Operator] azure stack hub inject env config incorrect

Closed

is depended on by

OCPBUGS-55162 [azure disk csi driver] ControllerUnpublishVolume failed of "Could not find member 'toBeDetached' on object of type 'DataDisk'" on ASH

Verified

is duplicated by

OCPBUGS-55044 azure-disk-csi-driver-node-sa missing node read rbac causing crashloop on AzureStackHub

Closed

links to

openshift/azure-disk-csi-driver#104: OCPBUGS-54382 OCPBUGS-55162: Fix ASH azure disk regression issues

RHEA-2024:11038 OpenShift Container Platform 4.19.z bug fix update

Assignee:: Penghao Wang

Reporter:: Patrick Dillon

Need Info From:: None

Contributors:: None

QA Contact:: Penghao Wang

Doc Contact:: None

Votes:: 0 Vote for this issue

Watchers:: 8 Start watching this issue

Created:: 2025/03/31 1:23 AM

Updated:: 2025/07/14 1:28 PM

Resolved:: 2025/06/17 4:58 PM

Details

Description

Attachments

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates