-
Bug
-
Resolution: Unresolved
-
Undefined
-
rhos-18.0.z
-
None
-
False
-
-
False
-
?
-
rhos-ops-platform-services-pidone
-
None
-
-
-
-
Important
Bug impact
- In RHOSO18, some services's replicas are 1, e.g. cinder-scheduler, nova-conductor or else.
For those replica: 1 service, OCP worker node availability is crucial to continue their services.
However, some situation, e.g. node failure or shutting down the node without cordon/drain will cause service interruption for more than 5 minutes.
To mitigate those issues, we have a doc, Monitoring high availability services[1].
Unfortunately, the doc[1] should be read by all users. Because 5 minutes or more service interruption generally unacceptable.
So, we should have a link to [1] at Deploying Red Hat OpenStack Services on OpenShift[2] as next reading for better user experience.
[1] https://docs.redhat.com/en/documentation/red_hat_openstack_services_on_openshift/18.0/html-single/monitoring_high_availability_services/index#assembly_monitoring-high-availability-services
[2] https://docs.redhat.com/en/documentation/red_hat_openstack_services_on_openshift/18.0/html-single/deploying_red_hat_openstack_services_on_openshift/index