Loading...

XML

Word

Printable

Type: Bug
Resolution: Unresolved
Priority: Normal
Fix Version/s: 4.22.0
Affects Version/s: 4.16, 4.17, 4.18, 4.19, 4.20, 4.21
Component/s: kube-apiserver
Labels:
None

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
Moderate
Regression:
None

Target Backport Versions:
None
Target Version:

4.22
Release Blocker:
None
Sprint:
None

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

PX Review Complete:
PX Impact Score:

Release Note Status:
Proposed
Release Note Type:
Bug Fix
Release Note Text:
Static pod pruner no longer accidentally deletes certificates when their associated cluster name contains ".tmp"

Escape Reason:
None
Escape Impact:
None
Corrective Measures:
None
SDLC stage when should've been found:
None

Description of problem:

When installing a cluster that contains the substring ".tmp" in the domain name, the static pod pruner will delete all certificates on all control plane nodes.

Version-Release number of selected component (if applicable):

any version containing https://github.com/openshift/library-go/pull/1103

cluster-etcd-operator has it all the way back to 4.8:
https://github.com/openshift/cluster-etcd-operator/blob/release-4.8/vendor/github.com/openshift/library-go/pkg/operator/staticpod/prune/cmd.go#L138-L142

How reproducible:

always

Steps to Reproduce:

    1. Create a cluster with a subdomain that contains .tmp, e.g. test.tmpgcp.devcluster.openshift.com
    2. Wait for installation and potentially force a few static pod rollouts on etcd/apiserver by changing the log level

Actual results:

the cluster annihaliates itself by deleting all certificates in the kubernetes manifest dir for all static pods

Expected results:

the cluster installs fine and continues to run stable

Additional info:

I already wrote a regression test for it here that showcases it being a problem:
https://github.com/openshift/library-go/pull/2025/files

links to

openshift/cluster-etcd-operator#1526: OCPBUGS-62422: deps: Update library-go to update pruner

openshift/cluster-kube-apiserver-operator#1994: OCPBUGS-62422: deps: Update library-go to update pruner

openshift/cluster-kube-controller-manager-operator#895: OCPBUGS-62422: dep: Update library-go to update pruner

openshift/cluster-kube-scheduler-operator#593: OCPBUGS-62422: deps: Update library-go to update pruner

openshift/library-go#2053: OCPBUGS-62422: staticpod/prune: Remove tmp cert file pruning

Assignee:: Ondřej Kupka

Reporter:: Thomas Jungblut

Need Info From:: None

Contributors:: None

QA Contact:: Ke Wang

Doc Contact:: None

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Created:: 2025/09/30 8:08 AM

Updated:: 2026/01/16 7:21 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates