Uploaded image for project: 'OpenShift Pipelines'
  1. OpenShift Pipelines
  2. SRVKP-8330

Increase in PipelineRun and TaskRun durations in Pipelines v1.19 with HA enabled

XMLWordPrintable

    • 3
    • False
    • Hide

      None

      Show
      None
    • False
    • Ranked Issues, Pipelines Sprint Pioneers 39

      Description of problem:

      When running with High Availability (HA) enabled and varying concurrency levels (10, 20, 30, 40), a noticeable increase in both PipelineRun (PR) and TaskRun (TR) durations was observed in version 1.19 compared to 1.18. This increase is consistent across all tested concurrency levels. Additionally, metrics such as TaskRun-to-pod-creation exhibited a similar increase, while the workqueue depth of the Pipelines Controller decreased in 1.19, which warrants attention.

       

      Test Setup:

      • HA Setup: Enabled
      • Tested Concurrency Levels: 10, 20, 30, 40
      • Metrics: Average duration (in seconds) for PR and TR executions.

      Observed Results:

      Version Concurrency TR Duration (s) PR Duration (s)
      1.18 10 8.0261 11.1365
        20 16.6893 36.288
        30 29.4091 67.608
        40 45.494 100.4822
      1.19 10 10.2609 17.31
        20 31.9666 65.015
        30 51.4273 109.257
        40 70.0653 151.947

       

      Impact:

      • Increased latency in TaskRun and PipelineRun execution in HA setup for 1.19.
      • Potential performance regression compared to 1.18.
      • Could affect scalability and throughput in production environments.

      Expected Behavior:
      PR/TR durations should remain consistent or improve with newer versions under similar load and HA configurations.

      Pipelines Controller - HA

      Report: https://docs.google.com/document/d/156iBOAZA6DRd7m8C1WMHSeHL83HhIqG1V1_8HHtTliU/edit?tab=t.w8jk5w8z8qgt
      ProwJob:

      • 1.19 (Right in chart):

      Prerequisites (if any, like setup, operators/versions):

      Steps to Reproduce

      Trigger jobs in CI release with prow commands as shown here:

       

      /pj-rehearse pull-ci-openshift-pipelines-performance-main-max-concurrency-downstream-1-19-1000-x-math-ha-10

      Actual results:

      Expected results:

      Reproducibility (Always/Intermittent/Only Once):

      Acceptance criteria: 

       

      Definition of Done:

      Build Details:

      Additional info (Such as Logs, Screenshots, etc):

       

       *

        1. Pipeline controller workqueue depth.png
          Pipeline controller workqueue depth.png
          44 kB
        2. PR_duration.png
          PR_duration.png
          68 kB
        3. Successful_PR_metrics.png
          Successful_PR_metrics.png
          75 kB
        4. Successfull_TR_Duration.png
          Successfull_TR_Duration.png
          73 kB
        5. TR_duration.png
          TR_duration.png
          75 kB
        6. TR_POD.png
          TR_POD.png
          44 kB

              rh-ee-vbobade Vibhav Bobade
              rh-ee-sira Siddardh R A
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

                Created:
                Updated: