-
Bug
-
Resolution: Unresolved
-
Normal
-
None
-
4.14
-
None
-
False
-
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
Description of problem:
In the OpenShift Dedicated (OSD) Policy Responsibility Matrix documentation, it is mentioned that Red Hat is responsible for monitoring the utilization of customer resources, including Network, Storage, and Compute capacity. According to the documentation, Red Hat is expected to alert customers when their worker nodes reach full capacity if autoscaling is not enabled. The specific statement is as follows:
"Red Hat responsibilities: Monitor utilization of customer resources including Network, Storage, and Compute capacity. Where autoscaling features are not enabled alert customer for any changes required to cluster resources (for example, new compute nodes to scale, additional storage, etc)."
However, this is inaccurate for OSD clusters. Currently, compute node monitoring and alerts are only provided for Red Hat OpenShift Service on AWS (ROSA) clusters and not for OSD clusters. The current OSD setup does not provide alerts specifically for worker node capacity issues when autoscaling is disabled.
This discrepancy leads to customer confusion and sets incorrect expectations about Red Hat’s monitoring responsibilities in OSD clusters when autoscaling is not enabled.
Expected results:
The documentation should be updated to clearly specify the monitoring and alerting capabilities and limitations for OSD clusters.
Additional Information:
The above information is confirmed in https://issues.redhat.com/browse/OHSS-37751