Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Normal
Fix Version/s: RHODS_1.28.0_GA
Affects Version/s: RHODS_1.26.0_GA
Component/s: Model Serving
Labels:
- modelserving

Story Points:
2
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Acceptance Criteria:
None
Affects Testing:

Testable
Automated:
No
CDW blocker:
CDW devel_ack:
CDW docs_ack:
CDW pm_ack:
CDW qa_ack:
CDW release:
Fixed in Build:
1.28.0
Regression:
No
Target Release:

RHODS_1.28.0_GA
Test Blocker:
No
Test Coverage:

Pending
Watchlist Impact:
None
Git Pull Request:
https://github.com/red-hat-data-services/odh-manifests/pull/377
Intelligence Requested:
Market:

Sprint:
ML Serving Sprint 1.28, ML Serving Sprint 1.29

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Description of problem:

An existing installation of RHODS had a complete break of the `odh-model-controller` pods after upgrading to OCP 4.13, and it looks like the root cause is the clusterrole being used (`manager-role`) which is an example one and used by RHACM as well, which might have caused the issue to RHODS during its own upgrade.
RHODS should use a different clusterrole to avoid any further issues.

More details: https://redhat-internal.slack.com/archives/C03UGJY6Z1A/p1684747070780669

Prerequisites (if any, like setup, operators/versions):

RHODS + RHACM (or any other project that uses the same clusterrole)

Steps to Reproduce

Unclear, but likely an upgrade to RHACM overrode the clusterrole that the odh-model-controller pods are using

Actual results:

odh-model-controller pods keep restarting

Expected results:

No issues caused by other projects

Reproducibility (Always/Intermittent/Only Once):

Happened only once that we know of, likely can happen again.

Build Details:

Workaround:

Additional info:

links to

opendatahub-io/modelmesh-serving#103: fix rhods-8873 and remove unnecessary manifests

opendatahub-io/odh-model-controller#43: Fix RHODS-8873 (Make role name to be unique)

mentioned on

Merge request - Updated 5 upstream sources

Merge request - Updated 10 upstream sources

Merge request - Updated US source to: ae38bb1 Merge pull request #377 from Jooho/rhods_8873

Solved by commit c33a6555836c1eb2c3a4b3a46f0402e2dacedaa9.

(5 mentioned on)

Assignee:: JOOHO LEE

Reporter:: Luca Giorgi

QA Contact:: Tarun Kumar

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Created:: 2023/05/24 2:04 PM

Updated:: 2023/06/20 6:11 AM

Resolved:: 2023/05/29 2:02 PM

Details

Description

Description of problem:

Prerequisites (if any, like setup, operators/versions):

Steps to Reproduce

Actual results:

Expected results:

Reproducibility (Always/Intermittent/Only Once):

Build Details:

Workaround:

Additional info:

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates