Uploaded image for project: 'OpenShift Request For Enhancement'
  1. OpenShift Request For Enhancement
  2. RFE-1425

Provide ability to horizontally scale Prometheus

    XMLWordPrintable

Details

    • Feature Request
    • Resolution: Done
    • Normal
    • None
    • None
    • Monitoring
    • False
    • False
    • 0
    • 0% 0%
    • Undefined

    Description

      1. Proposed title of this feature request
      Provide ability to horizontally scale Prometheus

      2. What is the nature and description of the request?
      Today a Prometheus instance scrapes all the endpoints, which limits the number of endpoints/series that can be collected. This request is to provide a way of addressing this limitation so that resources, especially memory, required by a single Prometheus instance stay reasonable.

      3. Why does the customer need this? (List the business requirements here)
      Running a big cluster (300 nodes made of big bare-metal servers) to support lots of jobs getting created at the same time Prometheus is currently configured with 500GB and still gets OOM killed time to time when job pods get in crashloopback for whatever reason.

      4. List any affected packages or components.
      Monitoring/Prometheus

      Attachments

        Issue Links

          Activity

            People

              rh-ee-rfloren Roger Florén
              rhn-support-fgiloux Frederic Giloux (Inactive)
              Votes:
              2 Vote for this issue
              Watchers:
              18 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: