XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Major
Fix Version/s: RHDG Operator CSV 8.5.8 GA
Affects Version/s: None
Component/s: Operator
Labels:
None

Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
CDW devel_ack:
CDW docs_ack:
CDW pm_ack:
CDW qa_ack:
CDW release:
Target Release:

RHDG Operator CSV 8.5.8 GA
Git Pull Request:
https://github.com/infinispan/infinispan-operator/pull/2272, https://github.com/infinispan/infinispan-operator/pull/2275
Intelligence Requested:
Market:

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Problem

During a GracefulShutdown the Operator does the following:

Ping the server to determine the version so that we can create the correct client
Disable global rebalancing via the REST endpoint by calling the -0 pod
For each pod in the cluster, call the shutdown endpoint

If the Operator progress is interrupted* during step 3, subsequent attempts to perform the GracefulShutdown could fail as the pod's cache-container has already been shutdown.

Furthermore, the pod list is not guaranteed to be in the same order each time which adds additional non-determinism.

*Progress maybe interrupted due to the Operator pod being restarted/rescheduled, or an unexpected error from the server.

Solution

When attempting the GracefulShutdown we should continue to the next pod if it returns an error response indicating the cache-container has already been stopped. We should output an appropriate log indicating that the pod has already been stopped.
We should make sure that all error logs associated with pod specific requests include the name of the pod to ease debugging in the future.
We should ensure that the order of pod names returned by ctx.InfinispanPods() is deterministic, sorted from the lowest to highest ordinal.

links to

Upstream issue

Assignee:: Ryan Emerson

Reporter:: Alan Field

Tester:: Pavel Drobek

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Created:: 2025/05/21 5:39 PM

Updated:: 2025/06/17 11:43 AM

Resolved:: 2025/05/21 5:41 PM

Details

Description

Problem

Solution

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates