-
Story
-
Resolution: Unresolved
-
Undefined
-
None
-
None
-
None
-
Future Sustainability
-
False
-
-
False
-
None
-
None
-
None
-
None
Conditions
Create a new condition in FlowCollector that sets status to degraded when there are many pod restarts (agents, FLP, plugin) in the last X hours (6 hours?)
Make it also degraded by detecting "infinite reconcile loops" (e.g. the FlowCollector reconcile being triggered more than X times (10? 20?) in the last N minutes (5?))
Alert
Create an operational metric+alert on status conditions (to check: maybe that metric already exists via the OLM metrics?)