Uploaded image for project: 'Network Observability'
  1. Network Observability
  2. NETOBSERV-2626

FlowCollector: improve visibility on degraded states

    • Icon: Story Story
    • Resolution: Unresolved
    • Icon: Undefined Undefined
    • None
    • None
    • FLP
    • None
    • Future Sustainability
    • False
    • Hide

      None

      Show
      None
    • False
    • None
    • None
    • None
    • None

      Conditions

      Create a new condition in FlowCollector that sets status to degraded when there are many pod restarts (agents, FLP, plugin) in the last X hours (6 hours?)

      Make it also degraded by detecting "infinite reconcile loops" (e.g. the FlowCollector reconcile being triggered more than X times (10? 20?) in the last N minutes (5?))

      Alert

      Create an operational metric+alert on status conditions (to check: maybe that metric already exists via the OLM metrics?)

              Unassigned Unassigned
              jtakvori Joel Takvorian
              None
              None
              None
              None
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

                Created:
                Updated: