Loading...

XML

Word

Printable

Type: Bug
Resolution: Unresolved
Priority: Major
Fix Version/s: None
Affects Version/s: None
Component/s: ArgoCD
Labels:

Story Points:
8
Blocked:
False
Blocked Reason:
None
Ready:
False

Sprint:
GITOPS Service EE Sprint 3254, GitOps Scarlet - Sprint 2261, GitOps Scarlet - Sprint 3257, GitOps Scarlet - Sprint 3259, GitOps Scarlet - Sprint 3260, GitOps Scarlet - Sprint 3262, GitOps Scarlet - Sprint 3263, GitOps Scarlet - Sprint 3264, GitOps Scarlet - Sprint 8/3267, GitOps Scarlet - Sprint 9/3268

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Sync of application may fail in rare condition as openshift-gitops-application-controller-0 is using /dev/shm as temporary directory. Since /dev/shm is limited to 64M in space (can't be changed in OpenShift Container Platform 4) it may eventually fill up and report and error similar to the below (which will cause application sync to fail).

Failed sync attempt to 306419123d21df53cbeeef7c289gf4531d579j7h: failed to initialize sync context: failed to write kubeconfig: write /dev/shm/3227598765: no space left on device

The view of the filesystem usage within the pod was as following.

overlay         120G   56G   65G  47% /
tmpfs            64M     0   64M   0% /dev
tmpfs            40G     0   40G   0% /sys/fs/cgroup
shm              64M   64M     0 100% /dev/shm                   <<-- is full by 100%
tmpfs            40G  121M   40G   1% /etc/passwd
/dev/sda4       120G   56G   65G  47% /etc/hosts
tmpfs            40G     0   40G   0% /app/config/controller/tls
tmpfs            40G   44K   40G   1% /run/secrets/kubernetes.io/serviceaccount

Once the pod openshift-gitops-application-controller-0 is re-created/restarted, /dev/shm is also reset and therefore application sync may start to work again. But there are chances that /dev/shm will fill-up again and hence cause the same issue over and over again.

It's therefore required to change the temporary directory from /dev/shm to something more feasible that potentially can even be changed in size.

relates to

GITOPS-5664 openshift-gitops-application-controller-0 pod crashes with OOMKILLED

Closed

links to

OpenShift GitOps is failing to sync due to "no space left on device" message

Troubleshooting Temporary File Leakage in Argo CD on OpenShift GitOps

Assignee:: Jonathan West

Reporter:: Simon Reber

Contributors:: Simon Reber

Votes:: 14 Vote for this issue

Watchers:: 23 Start watching this issue

Created:: 2022/06/14 6:24 AM

Updated:: 2025/02/05 11:28 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates