Loading...

XML

Word

Printable

Type: Story
Resolution: Unresolved
Priority: Minor
Fix Version/s: None
Affects Version/s: rhel-9.2.0
Component/s: kernel / File Systems / GFS-GFS2
Labels:
- MigratedToJIRA
- Reopened

Epic Link:
RHELPLAN-37280

Pool Team:

rhel-sst-filesystems
Sub-System Group:

ssg_filesystems_storage_and_HA

Story Points:
5
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Product Documentation Required:
None
Sprint:
None

Preliminary Testing:
None
Test Coverage:
None

Release Note Type:
If docs needed, set a value

Experience:
Architecture:

Unspecified
Bugzilla Bug:
RHBZ: 1962191

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Planning:
None

GFS2 currently doesn't allow a glock to be taken by the same task more than once; trying to take the same glock a second time leads to a BUG in add_to_queue(). As a workaround, gfs2_glock_is_locked_by_me() is used in several places to check if the glock is held already, followed by checks like gfs2_glock_is_held_excl().

The problem with that is that gfs2_glock_is_held_excl checks gl->gl_state, so when a glock is held in LM_ST_SHARED or LM_ST_DEFERRED state by the current task, the node may still have the glock cached in LM_ST_EXCLUSIVE state. In that situation, checking for gfs2_glock_is_held_excl() doesn't ensure that the glock will remain locked in LM_ST_EXCLUSIVE state. A possible workaround would be to "upgrade" the current holder (which is returned by gfs2_glock_is_locked_by_me()) to LM_ST_EXCLUSIVE. However, the lock would then remain locked in that state longer than necessary. A better solution would be to recognize the self-recursion and to allow the task to hold the glock a second time. In a LM_FLAG_TRY situation, we could even try to upgrade the glock.

This would reduce the number of gfs2_glock_is_locked_by_me() exceptions in the code. I'm also convinced that gfs2_glock_is_locked_by_me() and gfs2_glock_is_held_*() are used in unsafe ways; reworking that would allow us to properly clean that up.

Examples are the locking in gfs2_update_time() and the retries in gfs2_fault() and gfs2_page_mkwrite() which would sometimes be avoidable.

external trackers

Red Hat Issue Tracker RHELPLAN-80035

Assignee:: GFS2 Maintainers Bot

Reporter:: Andreas Gruenbacher

Developer:: GFS2 Maintainers Bot

QA Contact:: Cluster QE

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2023/09/22 9:07 PM

Updated:: 2024/01/09 4:24 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates