Uploaded image for project: 'OpenShift Logging'
  1. OpenShift Logging
  2. LOG-2682

Fluent collector labeling metrics with path name.

    XMLWordPrintable

Details

    • False
    • None
    • False
    • NEW
    • VERIFIED
    • Hide
      Before this update, this collector would include path as a label for the metrics it produced. This path changes frequently and contributed to significant storage changes for the Prometheus server. With this update, the label has been dropped to resolve the issue and reduce storage consumption.
      Show
      Before this update, this collector would include path as a label for the metrics it produced. This path changes frequently and contributed to significant storage changes for the Prometheus server. With this update, the label has been dropped to resolve the issue and reduce storage consumption.
    • Logging (Core) - Sprint 219, Logging (Core) - Sprint 220
    • Critical
    • Medium (50%-79%) [It is possible this will become an issue]
    • Customer Facing

    Description

      Problem statement

      The fluent collector is labelling metrics with the path of the log file, which creates too many time series and can overload prometheus.

      Background

      This used to be ok when CRIO used /var/log/containers and used a single file name for the life of the log file.

      After switching to /var/log/pods, CRIO now generates a series of numbered files, with the latest logs at the highest number.

      This means that including the file label multiples the number of time series by the number of file rotations, which overloads prometheus.

      Solution

      Remove the path label. It is no longer needed, and should have been removed when we switched to /var/log/pods

      Attachments

        Activity

          People

            jcantril@redhat.com Jeffrey Cantrill
            rhn-engineering-aconway Alan Conway
            Anping Li Anping Li
            Votes:
            2 Vote for this issue
            Watchers:
            8 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - 1 minute
                1m
                Remaining:
                Remaining Estimate - 1 minute
                1m
                Logged:
                Time Spent - Not Specified
                Not Specified