Loading...

XML

Word

Printable

Epic Name:
Consider terminating pods in job controller
Activity Type:
Future Sustainability
Parent Link:
OCPSTRAT-1115[Upstream] Consider terminating pods in job controller
Hierarchy Progress Bar:

0% To Do, 0% In Progress, 100% Done
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Color Status:
Not Selected
Size:
S

As an OpenShift administrator

I want to implement a pod failure policy in the job controller so that terminating pods (with a deletionTimestamp) are not immediately replaced and don't count as failed until they reach a terminal phase (Failed or Succeeded). This ensures a more efficient and accurate handling of pod failures.
I want to avoid creating replacement pods for pods that are in the process of terminating but have not yet reached a Failed or Succeeded state so that I can prevent unnecessary resource allocation and align the creation of replacement pods with the pod failure policy.
I want to extend Kubelet to mark pending terminating pods as failed so that the transition of these pods into the Failed phase is clearer and more consistent, enhancing the overall management of pod lifecycles.
I want to add a DisruptionTarget condition for pods preempted by Kubelet to make room for critical pods so that there is better visibility and management of pods that are disrupted for critical workload prioritization.