Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Major
Fix Version/s: None
Affects Version/s: 4.10
Component/s: Storage
Labels:
- hh1

Activity Type:
Incidents & Support
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
None
Regression:
None

Target Backport Versions:
None
Target Version:

4.13.0
Release Blocker:
Rejected
Sprint:
None

Customer Impact:

Customer Escalated
RH Private Keywords:

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

PX Priority Data:
PX Impact Score:

Release Note Status:
None
Release Note Type:
None
Release Note Text:
None

Escape Reason:
None
Escape Impact:
None
Corrective Measures:
None
SDLC stage when should've been found:
None

Description of problem:

The aws-efs-csi-driver-operator seems to have an issue where it loses connectivity to the underlying EFS system. Reviewing Github this could be related to a memory leak in stunnel which causes process to die. This seems to be addressed in the efs-utils version v1.34.2 and leverages stunnel version v5.58. Threads linked in the additional information section below.

Version-Release number of selected component (if applicable):

Cluster Version: 4.10.34
Operator Version: 4.10.0-202211041323
stunnel Version: stunnel-5.56-5.el8_3.x86_64

How reproducible:

Sporadically in nature

Actual results:

It seems to fail with the following error message: `nfs: server 127.0.0.1 not responding, still trying` occasionally causing the pod to lose access to the storage.

Expected results:

NFS mount stays connected and filesystem is accessible in pods.

Additional info:

Upstream AWS-EFS-CSI-driver Issue: https://github.com/kubernetes-sigs/aws-efs-csi-driver/issues/616

Upstream AWS EFS-Utils Issue: https://github.com/aws/efs-utils/issues/99#issuecomment-1326960406

We appear to be based off of the awa-efs-utils v1.34.1 and using stunnel v5.56 in the UBI8 image. 

https://github.com/openshift/aws-efs-utils/blob/release-4.10/amazon-efs-utils.spec#L34

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

nodememory.pdf
463 kB
2023/01/23 2:03 PM
node5-167-01-28.tar
1.35 MB
2023/01/31 2:21 PM
node_ip-10-250-5-167.log.tar.gz
16.16 MB
2023/02/01 9:38 PM
mount-watchdog.log
480 kB
2023/01/23 2:04 PM

is cloned by

OCPBUGS-7813 [release-4.12] [AWS EFS] NFS mount disconnects and becomes unavailable.

Closed

links to

RHEL stunnel Bugzilla

Assignee:: Tomas Smetana

Reporter:: Michael Tipton

Need Info From:: Michael Tipton

Contributors:: None

QA Contact:: Wei Duan

Doc Contact:: None

Votes:: 2 Vote for this issue

Watchers:: 17 Start watching this issue

Created:: 2022/12/19 7:31 PM

Updated:: 2025/10/08 12:51 PM

Resolved:: 2023/02/21 1:24 PM