Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Major
Fix Version/s: rhwa-25.9
Affects Version/s: rhwa-25.7
Component/s: Self Node Remediation
Labels:
None

Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Release Note Text:

Hide
Cause: Sometimes gathering IP addresses of peers temporarily fails for single nodes. The resulting error was bubbled up to the SNR agent start.
Consequence: The SNR agent did not start because of such an error.
Fix: Catch the error, and retry gathering IP addresses of peers instead.
Result: Temporary errors don't stop SNR agent start anymore.

Show
Cause: Sometimes gathering IP addresses of peers temporarily fails for single nodes. The resulting error was bubbled up to the SNR agent start. Consequence: The SNR agent did not start because of such an error. Fix: Catch the error, and retry gathering IP addresses of peers instead. Result: Temporary errors don't stop SNR agent start anymore.
Release Note Type:
Bug Fix
Release Note Status:
Proposed
Intelligence Requested:
Market:

Target Version:

rhwa-25.9

SFDC Cases Links:
SFDC Cases Open:
SFDC Cases Counter:

When a host machine is gracefully placed into maintenance mode (with NMO) and subsequently turned off, the other SNR pods running on the remaining, operational cluster hosts begin to crash.

This issue seems related to a sync failure with the turned off peer. The pods continue to crash until the offline node is successfully brought back online and its respective SNR pod is operational again

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

RHWA-382-4.20-connected-snr-nmo-12-nov.text
2025/11/12 9:47 AM
28 kB
vipin kumar

links to

medik8s/self-node-remediation#271: Add unit test to verify that Start() never fails for peer IP gathering issues

Assignee:: Marc Sluiter

Reporter:: Carlo Lobrano

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Created:: 2025/10/23 7:21 AM

Updated:: 2025/11/24 8:35 AM

Resolved:: 2025/11/17 9:08 PM

Details

Description

Attachments

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates

PagerDuty