Uploaded image for project: 'OpenShift Logging'
  1. OpenShift Logging
  2. LOG-6238

Implement alert for failed object store flushes

XMLWordPrintable

    • Icon: Task Task
    • Resolution: Done
    • Icon: Normal Normal
    • Logging 6.4.0
    • Logging 6.0.0, Logging 6.1.0, Logging 6.2.0, Logging 6.3.0, Logging 6.4.0
    • Log Storage
    • None
    • Log Storage - Sprint 261, Log Storage - Sprint 262, Log Storage - Sprint 263, Log Storage - Sprint 268, Logging - Sprint 275, Logging - Sprint 276, Logging - Sprint 277, Logging - Sprint 278

      A common but not immediately obvious way to misconfigure LokiStack is to provide it with a secret that is syntactically correct but invalid for accessing the object storage.

      This error will only become visible, once Loki tries to interact with the object storage, for example when the ingester tries to push new chunks into it.

      A once valid secret might also become invalid over time if it expires or the account is deactivated.

      Loki 3.1 now added a metric showing the number of failed chunk flushes: loki_ingester_chunks_flush_failures_total

      We should be able to create an alert based on that metric that fires when there are flush failures and alerts the operator if this issue persists for a while indicating a failure with either the object storage itself or the secret.

              jmarcal@redhat.com Joao Marcal
              rojacob@redhat.com Robert Jacob
              Kabir Bharti Kabir Bharti
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

                Created:
                Updated:
                Resolved: