Loading...

XML

Word

Printable

Type: Bug
Resolution: Unresolved
Priority: Undefined
Fix Version/s: None
Affects Version/s: rhos-18.0 FR 1 (Nov 2024)
Component/s: nova-operator
Labels:
None

Story Points:
1
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Docs Approval:
?
Regression:
None
Intelligence Requested:
Market:

Sprint:
Compute Next Sprint Candidates
sprint_count:
1
Severity:
Moderate

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

This was noticed during a cell deletion related code review.

https://github.com/openstack-k8s-operators/nova-operator/blame/c7cb16745a8c54a36a4862900434df9a493d1198/controllers/nova_controller.go#L719-L780

Here the cell deletion logic starts the job that deletes the cell mapping from the cell0 DB but never waits for the job to finish and moves forward to delete the NovaCell CR representing the cell and also removes it from Nova.status.RegisteredCells list.

I foresee that this allows a race window to exist in the following scenario:

user deletes cell2
nova-operator starts the cell2 cell mapping deletion job and deletes NovaCell/cell2
the job is slow to schedule to a worker or slow to run due to cell0 DB slowness
user decides to (re)create cell2 as a new cell. (Maybe the user deleted cell2 as it failed somehow and wants to re-try the cell creation by deleting and re-creating it)
nova-operator creates the new NovaCell/cell2 and eventually starts the cell mapping job.
Now the cell mapping deletion job and the cell mapping job for the same cellname (cell2) runs in parallel and if the cell mapping job runs first then that see and updates the existing mapping, then the cell mapping deletion job simply removes the cell mapping. Leading to a ready cell from nova-operator perspective but an unmapped cell from openstack perspective.

links to

openstack-k8s-operators/nova-operator#917: Delete NovaCell cr only when deletion job pass

Assignee:: Kamil Sambor

Reporter:: Balazs Gibizer

Team:: rhos-dfg-compute

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2024/12/20 10:10 AM

Updated:: 2025/02/28 12:17 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates

PagerDuty