-
Task
-
Resolution: Unresolved
-
Critical
-
None
-
None
-
False
-
-
False
-
Unset
-
No
-
-
Description of Problem
In the past 12 hours, there has been a significant increase in lag, accompanied by a failure to report this lag to Prometheus. This unreported lag is exacerbating the overall lag issue. It is crucial to resolve this problem before the General Availability (GA) release to ensure optimal performance and accurate monitoring. Please investigate and address this reporting failure promptly.
Ref Link :
https://redhat-internal.slack.com/archives/C04B6CJ3A81/p1707908870245839
How reproducible
Steps to Reproduce
NA
Actual Behavior
In the past 12 hours, there has been a significant increase in system lag, and failures to report this lag to Prometheus have been observed. This failure to report is contributing to the overall lag issue, impacting system performance and monitoring accuracy.
Expected Behavior
System performance should remain stable without significant increases in lag. Any instances of lag should be accurately reported to Prometheus to ensure proper monitoring and timely resolution of issues. There should be no failures in reporting lag to Prometheus, ensuring all performance data is captured correctly.