Uploaded image for project: 'OpenShift Logging'
  1. OpenShift Logging
  2. LOG-6238

Implement alert for failed object store flushes

XMLWordPrintable

    • Icon: Task Task
    • Resolution: Done
    • Icon: Normal Normal
    • Logging 6.4.0
    • Logging 6.0.0, Logging 6.1.0, Logging 6.2.0, Logging 6.3.0, Logging 6.4.0
    • Log Storage
    • None
    • Log Storage - Sprint 261, Log Storage - Sprint 262, Log Storage - Sprint 263, Log Storage - Sprint 268, Logging - Sprint 275, Logging - Sprint 276, Logging - Sprint 277, Logging - Sprint 278

      A common but not immediately obvious way to misconfigure LokiStack is to provide it with a secret that is syntactically correct but invalid for accessing the object storage.

      This error will only become visible, once Loki tries to interact with the object storage, for example when the ingester tries to push new chunks into it.

      A once valid secret might also become invalid over time if it expires or the account is deactivated.

      Loki 3.1 now added a metric showing the number of failed chunk flushes: loki_ingester_chunks_flush_failures_total

      We should be able to create an alert based on that metric that fires when there are flush failures and alerts the operator if this issue persists for a while indicating a failure with either the object storage itself or the secret.

        1. Screenshot 2025-10-12 at 7.57.29 PM.png
          204 kB
          Kabir Bharti
        2. Screenshot 2025-10-12 at 7.56.02 PM.png
          256 kB
          Kabir Bharti
        3. Screenshot 2025-10-12 at 7.55.54 PM.png
          307 kB
          Kabir Bharti

              jmarcal@redhat.com Joao Marcal
              rojacob@redhat.com Robert Jacob
              Kabir Bharti Kabir Bharti
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

                Created:
                Updated:
                Resolved: