Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-65713

HC could't be ready when HostedControlPlane is 4.21, NodePool is 4.18./4.19

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Critical Critical
    • None
    • 4.21
    • HyperShift
    • None
    • False
    • Hide

      None

      Show
      None
    • None
    • Critical
    • None
    • None
    • Approved
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      Description of problem:

      NodePool has skew with HostedControlPlane, the np is using 4.18 or 4.19 while hcp is in 4.21, hc could't be ready, since service-ca-operator fails due to UserNamespacesSupport feature gate mismatch. That caused many clusteroperator failed with error  such as 'tls secret not found'.
          

      Version-Release number of selected component (if applicable):

      4.21
          

      How reproducible:

      always
          

      Steps to Reproduce:

          1.Render hc resources to yaml file when creating 4.21 hostedcluster 
          2.Update the np release image to 4.18 or 4.19 version, and apply all the resources.
          3.Check np and hcp
          

      Actual results:
      Step3 : np is ready, but hc couldn't be ready since many clusteroperator failed

      oc get hc -A
      NAMESPACE   NAME                  VERSION   KUBECONFIG                             PROGRESS   AVAILABLE   PROGRESSING   MESSAGE
      clusters    wxj-18-nodeskew-421             wxj-18-nodeskew-421-admin-kubeconfig   Partial    True        False         The hosted control plane is available
      oc get np -A
      NAMESPACE   NAME                             CLUSTER               DESIRED NODES   CURRENT NODES   AUTOSCALING   AUTOREPAIR   VERSION                              UPDATINGVERSION   UPDATINGCONFIG   MESSAGE
      clusters    wxj-18-nodeskew-421-us-east-2a   wxj-18-nodeskew-421   2               2               False         False        4.18.0-0.nightly-2025-11-17-063304   False             False
      

      Many clusteroperators in guest cluster are degraded

      oc get co
      NAME                                       VERSION                              AVAILABLE   PROGRESSING   DEGRADED   SINCE   MESSAGE
      console                                                                                                                      
      csi-snapshot-controller                    4.21.0-0.nightly-2025-11-13-042845   True        False         False      4h36m   
      dns                                        4.21.0-0.nightly-2025-11-13-042845   False       True          True       4h36m   DNS "default" is unavailable.
      image-registry                                                                  False       True          True       4h37m   Available: The deployment does not have available replicas...
      ingress                                                                         False       True          True       4h36m   The "default" ingress controller reports Available=False: IngressControllerUnavailable: One or more status conditions indicate unavailable: DeploymentAvailable=False (DeploymentUnavailable: The deployment has Available status condition set to False (reason: MinimumReplicasUnavailable) with message: Deployment does not have minimum availability.)
      insights                                   4.21.0-0.nightly-2025-11-13-042845   False       False         True       165m    Unable to report: unable to build request to connect to Insights server: Post "https://console.redhat.com/api/ingress/v1/upload": dial tcp: lookup console.redhat.com on 172.31.0.10:53: read udp 10.133.0.10:41898->172.31.0.10:53: read: connection refused
      kube-apiserver                             4.21.0-0.nightly-2025-11-13-042845   True        False         False      4h37m   
      kube-controller-manager                    4.21.0-0.nightly-2025-11-13-042845   True        False         False      4h37m   
      kube-scheduler                             4.21.0-0.nightly-2025-11-13-042845   True        False         False      4h37m   
      kube-storage-version-migrator              4.21.0-0.nightly-2025-11-13-042845   True        False         False      4h31m   
      monitoring                                                                                                                   
      network                                    4.21.0-0.nightly-2025-11-13-042845   True        True          False      4h36m   DaemonSet "/openshift-multus/network-metrics-daemon" is waiting for other operators to become ready...
      node-tuning                                4.21.0-0.nightly-2025-11-13-042845   True        False         False      4h33m   
      openshift-apiserver                        4.21.0-0.nightly-2025-11-13-042845   True        False         False      4h37m   
      openshift-controller-manager               4.21.0-0.nightly-2025-11-13-042845   True        False         False      4h37m   
      openshift-samples                                                                                                            
      operator-lifecycle-manager                 4.21.0-0.nightly-2025-11-13-042845   True        False         False      4h36m   
      operator-lifecycle-manager-catalog         4.21.0-0.nightly-2025-11-13-042845   True        False         False      4h37m   
      operator-lifecycle-manager-packageserver   4.21.0-0.nightly-2025-11-13-042845   True        False         False      4h37m   
      service-ca                                                                                                                   
      storage                                    4.21.0-0.nightly-2025-11-13-042845   True        False         False      4h33m 
      

      Service ca pod has error "the feature gate "UserNamespacesSupport" is disabled: can't set spec.HostUsers"

      oc get pods -n openshift-service-ca-operator
      NAME                                   READY   STATUS              RESTARTS   AGE
      service-ca-operator-85cdc9949d-cmmrw   0/1     ContainerCreating   0          4h34m
      
      oc describe pods -n openshift-service-ca-operator 
      Events:
        Type     Reason                  Age                       From     Message
        ----     ------                  ----                      ----     -------
        Warning  FailedCreatePodSandBox  2m56s (x1246 over 4h33m)  kubelet  Failed to create pod sandbox: the feature gate "UserNamespacesSupport" is disabled: can't set spec.HostUsers
       

      Image registry failed due the tls secret not created

      oc get pods -n openshift-image-registry
      NAME                              READY   STATUS              RESTARTS   AGE
      image-registry-5d5c766c68-mksmp   0/1     ContainerCreating   0          4h34m
      image-registry-7d657df547-ccx5n   0/1     ContainerCreating   0          4h35m
      node-ca-cd8f9                     1/1     Running             0          4h35m
      node-ca-w85x9                     1/1     Running             0          4h35m
      oc describe pods image-registry-5d5c766c68-mksmp -n openshift-image-registry
      Events:
        Type     Reason       Age                      From     Message
        ----     ------       ----                     ----     -------
        Warning  FailedMount  3m33s (x141 over 4h34m)  kubelet  MountVolume.SetUp failed for volume "registry-tls" : secret "image-registry-tls" not found
          

      Expected results:

      The hc should be ready when np has skew with hcp.
          

      Additional info:

      Always meet this issue when hcp is 4.21 while np is 4.18/4.19/4.17(np skew Y-4 is not support offically)
      Can't meet this issue when hcp is 4.20 while np is 4.18/4.19
      Can't meet this issue when hcp is 4.21 while np is 4.16(This test scope is not support offically, just show example here)
          

      Dump log https://drive.google.com/file/d/1Vl6JjNTlmHnGVA-JlwPSVhtOFm7c2eQ_/view?usp=drive_link

              jparrill@redhat.com Juan Manuel Parrilla Madrid
              rh-ee-xiuwang XiuJuan Wang
              None
              None
              XiuJuan Wang XiuJuan Wang
              None
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

                Created:
                Updated: