-
Bug
-
Resolution: Done
-
Blocker
-
RHOAI_2.8.0, RHOAI_2.8.1, RHOAI_2.9.0
What
DSC keeps reconciling the components. It seems happening when codeflare and/or kueue and/or ray are enabled
UPDATE: from latest tests on RHOAI 2.8.1 and 2.8.0, I think it happens when kueue and ray are enabled together
Steps to reproduce
- install RHOAI 2.8.1 or 2.9.0 (we haven't checked previous releases, but it could be present even before)
- create DSC enaling kueue and ray (or all of the components)
- wait until all the components get deployed in redhat-ods-applications namespace
- check DSC: still reconciling
- disable the above components
- wait for components pod to be removed
- check DSC: not reconciling anymore
Additional info
- rhods-operator logs: rhods-operator-57cdf87f74-2ln4d- (1).log
- must-gather: must-gather.local.4928626708380685603.zip
- Unable to reproduce with 2.9.0 RC 1 so far
Environment
- IBM Cloud with RHOAI 2.9 pre-RC iib:704539 and OCP 4.15.8
- PSI Cluster (OpenStack) with RHOAI 2.8.1 from OperatorHub (stable channel) and OCP 4.15.8
- is caused by
-
RHOAIENG-5501 Kuberay creates opendatahub namespace when enabled
- Resolved
-
RHOAIENG-6348 Kueue creates opendatahub namespace when enabled
- Resolved
- is cloned by
-
RHOAIENG-6276 [2.8.z] Infinite reconciliation loop of DataScienceCluster (DSC)
- Closed