Loading...

XML

Word

Printable

As an OpenShift administrator

I want to implement a pod failure policy in the job controller so that terminating pods (with a deletionTimestamp) are not immediately replaced and don't count as failed until they reach a terminal phase (Failed or Succeeded). This ensures a more efficient and accurate handling of pod failures.
I want to avoid creating replacement pods for pods that are in the process of terminating but have not yet reached a Failed or Succeeded state so that I can prevent unnecessary resource allocation and align the creation of replacement pods with the pod failure policy.
I want to extend Kubelet to mark pending terminating pods as failed so that the transition of these pods into the Failed phase is clearer and more consistent, enhancing the overall management of pod lifecycles.
I want to add a DisruptionTarget condition for pods preempted by Kubelet to make room for critical pods so that there is better visibility and management of pods that are disrupted for critical workload prioritization.

relates to

RFE-2328 FailedScheduling event in specific environments with two replicas