Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-76627

KAS Fails to Reload Certificates on ARO-HCP (INT)

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Major Major
    • None
    • 4.20.z
    • HyperShift / ARO
    • None
    • None
    • False
    • Hide

      None

      Show
      None
    • None
    • Important
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      Description of problem:

      In the Integration (INT) environment, the Kube API Server (KAS) is failing to reload certificates specifically during the creation of breakglass credentials. While the certificate rotation controller successfully generates the new breakglass secrets, the KAS does not pick them up, rendering the emergency credentials unusable. 

      Version-Release number of selected component (if applicable):

          4.20.8

      How reproducible:

          Trigger Event: The failure occurs during the generation of breakglass credentials.Controller Action: The rotation controller correctly reconciles and produces the new certificate secrets.
      
      KAS Behavior: The KAS remains stale; it does not acknowledge the new certificates or reload the secret mount, causing the INT tests to fail.
      
      Environmental Context: This is isolated to the INT environment. Both STG and PROD are handling the breakglass workflow correctly.

      Steps to Reproduce:

          1.
          2.
          3.
          

      Actual results:

      We observe on ARO-HCP, we do not see the KAS reload certificates, even though the certificate rotation controller produces new ones.

      Expected results:

      We need for on ARO-HCP that in INT, need to see the KAS reload certificates, with the new produced certificate.

      Additional info:

      We have attached a must-gather for the HyperShift team to analyze the communication between the management cluster and the hosted control plane:
      
      PKI Operator: Logs confirming the creation of the breakglass secret.
      
      KAS: Logs and pod volume mount status to determine if the secret update was propagated.    

              Unassigned Unassigned
              rhn-engineering-sirkal Sajeel Irkal
              He Liu He Liu
              None
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated: