Uploaded image for project: 'OpenShift Logging'
  1. OpenShift Logging
  2. LOG-2682

Fluent collector labeling metrics with path name.

XMLWordPrintable

    • False
    • None
    • False
    • NEW
    • VERIFIED
    • Hide
      Before this update, this collector would include path as a label for the metrics it produced. This path changes frequently and contributed to significant storage changes for the Prometheus server. With this update, the label has been dropped to resolve the issue and reduce storage consumption.
      Show
      Before this update, this collector would include path as a label for the metrics it produced. This path changes frequently and contributed to significant storage changes for the Prometheus server. With this update, the label has been dropped to resolve the issue and reduce storage consumption.
    • Logging (Core) - Sprint 219, Logging (Core) - Sprint 220
    • Critical
    • Medium (50%-79%) [It is possible this will become an issue]
    • Customer Facing

      Problem statement

      The fluent collector is labelling metrics with the path of the log file, which creates too many time series and can overload prometheus.

      Background

      This used to be ok when CRIO used /var/log/containers and used a single file name for the life of the log file.

      After switching to /var/log/pods, CRIO now generates a series of numbered files, with the latest logs at the highest number.

      This means that including the file label multiples the number of time series by the number of file rotations, which overloads prometheus.

      Solution

      Remove the path label. It is no longer needed, and should have been removed when we switched to /var/log/pods

              jcantril@redhat.com Jeffrey Cantrill
              rhn-engineering-aconway Alan Conway
              Anping Li Anping Li
              Votes:
              2 Vote for this issue
              Watchers:
              8 Start watching this issue

                Created:
                Updated:
                Resolved:

                  Estimated:
                  Original Estimate - 1 minute
                  1m
                  Remaining:
                  Remaining Estimate - 1 minute
                  1m
                  Logged:
                  Time Spent - Not Specified
                  Not Specified