Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Major
Fix Version/s: None
Affects Version/s: Pipelines 1.19.0
Component/s: Tekton Results
Labels:

Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Release Note Type:
Release Note Not Required
Intelligence Requested:
Market:

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Description of problem:

When Tekton Results becomes overloaded, for example when a surge of 5-6x the normal volume of PLRs completes Results can get into a state where the workqueue is so large that it is unable to process new object creation before the objects are deleted. It is unable to add finalizers to TaskRuns and PipelineRuns before they have completed and been pruned. Because now almost every queued event cannot be processed, Results appears to get into a state where it tries to reconcile every object, fails in a permanent way, but still attempts to retry the reconciliation after some time. This results in the workqueue being "low", and reconciliation latency being "low", but reconciliation success rate being extremely poor
All of these thousands of stale reconciliations are not invisibly stored in the retry queue, even though their k8s objects have long since been deleted.

Recovery for this is straightforward but manual: restart the pod. Results needs to be able to recover from this properly however. If an object no longer exists in the cluster, we shouldn't keep retrying to reconcile it.

Prerequisites (if any, like setup, operators/versions):

Steps to Reproduce

# <steps>

Actual results:

Expected results:

Reproducibility (Always/Intermittent/Only Once):

Acceptance criteria:

Definition of Done:

Build Details:

Additional info (Such as Logs, Screenshots, etc):

*

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

image-2025-06-27-14-56-51-366.png
26 kB
2025/06/27 6:56 PM
image-2025-06-27-14-59-56-963.png
192 kB
2025/06/27 6:59 PM
image-2025-06-27-15-03-25-125.png
103 kB
2025/06/27 7:03 PM

Assignee:: Unassigned

Reporter:: Andrew Thorp

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Created:: 2025/06/27 7:06 PM

Updated:: 2025/08/15 1:26 PM

Resolved:: 2025/08/15 1:25 PM

Details

Description

Description of problem:

Prerequisites (if any, like setup, operators/versions):

Steps to Reproduce

Actual results:

Expected results:

Reproducibility (Always/Intermittent/Only Once):

Acceptance criteria:

Build Details:

Additional info (Such as Logs, Screenshots, etc):

*

Attachments

Attachments

Easy Agile Planning Poker

Activity

People

Dates