Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Normal
Fix Version/s: 4.19.z
Affects Version/s: 4.16, 4.16.z
Component/s: Storage / Kubernetes
Labels:
- FUSE
- plugin

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
Moderate
Regression:
None

Target Backport Versions:
None
Target Version:

4.19.z
Release Blocker:
None
Sprint:
None

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

PX Review Complete:
PX Priority Data:
PX Impact Score:
PX Technical Impact:
PX Impact Range:

Release Note Status:
Done
Release Note Type:
Release Note Not Required
Release Note Text:
N/A

Escape Reason:
None
Escape Impact:
None
Corrective Measures:
None
SDLC stage when should've been found:
None

Description of problem:

The kubelet crashes across clusters, caused by a `fatal error: concurrent map iteration and map write`. This crash has been traced to the FindPluginBySpec() function in the VolumePluginMgr, impacting FlexVolume operations (particularly with the ibmc-s3fs plugin). These crashes lead to stale PV mounts and application disruptions due to FUSE processes not being cleaned up.

In reference to a Kubernetes upstream issue #124839 and its fix PR #129755 , which match the observed behavior. Logs from affected nodes confirm the panic message, and it aligns with known FlexVolume activity during DaemonSet rollouts.

~~~
Jun 30 16:38:04 popular-reptile-x-large-wdc-containers-nonprod1 kubenswrapper[6186]: fatal error: concurrent map iteration and map write Jun 30 16:38:04 popular-reptile-x-large-wdc-containers-nonprod1 kubenswrapper[6186]: goroutine 280507699 [running]: Jun 30 16:38:04 popular-reptile-x-large-wdc-containers-nonprod1 kubenswrapper[6186]: k8s.io/kubernetes/pkg/volume.(*VolumePluginMgr).FindPluginBySpec(0xc0003c4d08, 0xc0079b04e0) Jun 30 16:38:04 popular-reptile-x-large-wdc-containers-nonprod1 kubenswrapper[6186]: k8s.io/kubernetes/pkg/volume/plugins.go:683 +0x327

~~~

As per the slack discussion [1] creating this Bug.

The customer is seeking clarity on whether the upstream fix will be backported to any supported OpenShift 4.x version.

[1] https://redhat-internal.slack.com/archives/CK1AE4ZCK/p1753114688440409

Actual results:

The customer is seeking clarity on whether the upstream fix will be backported to any supported OpenShift 4.x version.

Expected results:

Additional info:

blocks

OCPBUGS-65494 [release-4.18] Kubelet crash - fatal error: concurrent map iteration and map write

Closed

is blocked by

OCPBUGS-64770 [release-4.20] Kubelet crash - fatal error: concurrent map iteration and map write

Closed

is cloned by

OCPBUGS-64770 [release-4.20] Kubelet crash - fatal error: concurrent map iteration and map write

Closed

OCPBUGS-65494 [release-4.18] Kubelet crash - fatal error: concurrent map iteration and map write

Closed

links to

openshift/kubernetes#2509: OCPBUGS-59651: Fix Concurrentmap Iteration

Assignee:: Maxim Patlasov

Reporter:: Roopa R

QA Contact:: Wei Duan

Need Info From:: Jitendar Singh

Votes:: 0 Vote for this issue

Watchers:: 9 Start watching this issue

Created:: 2025/07/22 4:50 PM

Updated:: 2025/11/19 5:08 AM

Resolved:: 2025/11/19 5:08 AM

Details

Description

Description of problem:

Actual results:

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates