Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-31874

Cluster operator image-registry is Degraded when using hive deploying OpenStack cluster.

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Not a Bug
    • Icon: Major Major
    • None
    • 4.14
    • Image Registry
    • No
    • False
    • Hide

      None

      Show
      None

      Description of problem:

      Cluster operator image-registry is Degraded when using hive deploying OpenStack cluster. 
      And the image-registry pod are CrashLoopBackOff.
      [hmx@fedora hive]$ oc get pod -n openshift-image-registry NAME                                               READY   STATUS             RESTARTS       AGE cluster-image-registry-operator-768f87dc77-dk84d   1/1     Running            1 (29m ago)    44m image-registry-5bb44d4b85-mt4dp                    0/1     CrashLoopBackOff   10 (55s ago)   29m image-registry-5bb44d4b85-v5s2q                    0/1     CrashLoopBackOff   10 (57s ago)   29m node-ca-47c5t                                      1/1     Running            0              29m node-ca-cxlzd                                      1/1     Running            0              29m node-ca-d7pn9                                      1/1     Running            0              27m node-ca-l54ct                                      1/1     Running            0              27m node-ca-rxmv9                                      1/1     Running            0              27m node-ca-zqgdb                                      1/1     Running            0              29m

      Version-Release number of selected component (if applicable):

      registry.ci.openshift.org/ocp/release:4.14.0-0.nightly-2024-04-04-065345

      How reproducible:

          Always

      Steps to Reproduce:

          1.Using hive deploying OpenStack cluster.
          2.master/woeker nodes are running, but image-registry operator is Degraded.
          3.
          

      Actual results:

          image-registry operator is Degraded.

      Expected results:

          image-registry operator is ready.

      Additional info:

          [hmx@fedora hive]$ oc get co
      NAME                                       VERSION                              AVAILABLE   PROGRESSING   DEGRADED   SINCE   MESSAGE
      authentication                             4.14.0-0.nightly-2024-04-04-065345   True        False         False      15m     
      baremetal                                  4.14.0-0.nightly-2024-04-04-065345   True        False         False      38m     
      cloud-controller-manager                   4.14.0-0.nightly-2024-04-04-065345   True        False         False      41m     
      cloud-credential                           4.14.0-0.nightly-2024-04-04-065345   True        False         False      43m     
      cluster-autoscaler                         4.14.0-0.nightly-2024-04-04-065345   True        False         False      38m     
      config-operator                            4.14.0-0.nightly-2024-04-04-065345   True        False         False      39m     
      console                                    4.14.0-0.nightly-2024-04-04-065345   True        False         False      20m     
      control-plane-machine-set                  4.14.0-0.nightly-2024-04-04-065345   True        False         False      29m     
      csi-snapshot-controller                    4.14.0-0.nightly-2024-04-04-065345   True        False         False      38m     
      dns                                        4.14.0-0.nightly-2024-04-04-065345   True        False         False      38m     
      etcd                                       4.14.0-0.nightly-2024-04-04-065345   True        False         False      31m     
      image-registry                                                                  False       True          True       26m     Available: The deployment does not have available replicas...
      ingress                                    4.14.0-0.nightly-2024-04-04-065345   True        False         False      24m     
      insights                                   4.14.0-0.nightly-2024-04-04-065345   True        False         False      32m     
      kube-apiserver                             4.14.0-0.nightly-2024-04-04-065345   True        False         False      26m     
      kube-controller-manager                    4.14.0-0.nightly-2024-04-04-065345   True        False         False      35m     
      kube-scheduler                             4.14.0-0.nightly-2024-04-04-065345   True        False         False      35m     
      kube-storage-version-migrator              4.14.0-0.nightly-2024-04-04-065345   True        False         False      38m     
      machine-api                                4.14.0-0.nightly-2024-04-04-065345   True        False         False      23m     
      machine-approver                           4.14.0-0.nightly-2024-04-04-065345   True        False         False      38m     
      machine-config                             4.14.0-0.nightly-2024-04-04-065345   True        False         False      38m     
      marketplace                                4.14.0-0.nightly-2024-04-04-065345   True        False         False      38m     
      monitoring                                 4.14.0-0.nightly-2024-04-04-065345   True        False         False      22m     
      network                                    4.14.0-0.nightly-2024-04-04-065345   True        False         False      40m     
      node-tuning                                4.14.0-0.nightly-2024-04-04-065345   True        False         False      38m     
      openshift-apiserver                        4.14.0-0.nightly-2024-04-04-065345   True        False         False      26m     
      openshift-controller-manager               4.14.0-0.nightly-2024-04-04-065345   True        False         False      34m     
      openshift-samples                          4.14.0-0.nightly-2024-04-04-065345   True        False         False      25m     
      operator-lifecycle-manager                 4.14.0-0.nightly-2024-04-04-065345   True        False         False      38m     
      operator-lifecycle-manager-catalog         4.14.0-0.nightly-2024-04-04-065345   True        False         False      38m     
      operator-lifecycle-manager-packageserver   4.14.0-0.nightly-2024-04-04-065345   True        False         False      26m     
      service-ca                                 4.14.0-0.nightly-2024-04-04-065345   True        False         False      39m     
      storage                                    4.14.0-0.nightly-2024-04-04-065345   True        False         False      37m     
      [hmx@fedora hive]$ oc get machineset -A
      NAMESPACE               NAME                         DESIRED   CURRENT   READY   AVAILABLE   AGE
      openshift-machine-api   mihuangoshv-ww9wt-worker-0   3         3         3       3           45m
      [hmx@fedora hive]$ 
      [hmx@fedora hive]$ oc get machines -A
      NAMESPACE               NAME                               PHASE     TYPE             REGION      ZONE   AGE
      openshift-machine-api   mihuangoshv-ww9wt-master-0         Running   m1.xlarge.disk   regionOne   nova   45m
      openshift-machine-api   mihuangoshv-ww9wt-master-1         Running   m1.xlarge.disk   regionOne   nova   45m
      openshift-machine-api   mihuangoshv-ww9wt-master-2         Running   m1.xlarge.disk   regionOne   nova   45m
      openshift-machine-api   mihuangoshv-ww9wt-worker-0-bzd9t   Running   m1.xlarge.disk   regionOne   nova   38m
      openshift-machine-api   mihuangoshv-ww9wt-worker-0-g4qqv   Running   m1.xlarge.disk   regionOne   nova   38m
      openshift-machine-api   mihuangoshv-ww9wt-worker-0-l9rn9   Running   m1.xlarge.disk   regionOne   nova   38m
      
      [hmx@fedora hive]$ oc get pod -n openshift-image-registry
      NAME                                               READY   STATUS             RESTARTS       AGE
      cluster-image-registry-operator-768f87dc77-dk84d   1/1     Running            1 (29m ago)    44m
      image-registry-5bb44d4b85-mt4dp                    0/1     CrashLoopBackOff   10 (55s ago)   29m
      image-registry-5bb44d4b85-v5s2q                    0/1     CrashLoopBackOff   10 (57s ago)   29m
      node-ca-47c5t                                      1/1     Running            0              29m
      node-ca-cxlzd                                      1/1     Running            0              29m
      node-ca-d7pn9                                      1/1     Running            0              27m
      node-ca-l54ct                                      1/1     Running            0              27m
      node-ca-rxmv9                                      1/1     Running            0              27m
      node-ca-zqgdb                                      1/1     Running            0              29m
      
      [hmx@fedora hive]$ oc describe pod image-registry-5bb44d4b85-mt4dp -o yaml
      error: unknown shorthand flag: 'o' in -o
      See 'oc describe --help' for usage.
      [hmx@fedora hive]$ oc describe pod image-registry-5bb44d4b85-mt4dp
      Error from server (NotFound): pods "image-registry-5bb44d4b85-mt4dp" not found
      [hmx@fedora hive]$ oc describe pod image-registry-5bb44d4b85-mt4dp -n openshift-image-registry
      Name:                 image-registry-5bb44d4b85-mt4dp
      Namespace:            openshift-image-registry
      Priority:             2000000000
      Priority Class Name:  system-cluster-critical
      Service Account:      registry
      Node:                 mihuangoshv-ww9wt-worker-0-l9rn9/192.168.2.192
      Start Time:           Mon, 08 Apr 2024 20:05:11 +0800
      Labels:               docker-registry=default
                            pod-template-hash=5bb44d4b85
      Annotations:          imageregistry.operator.openshift.io/dependencies-checksum: sha256:0b0dea928daf2900e7ef2411c4b4a4191262b8545b0d29d4a24b7e8744b65b7c
                            k8s.ovn.org/pod-networks:
                              {"default":{"ip_addresses":["10.131.0.8/23"],"mac_address":"0a:58:0a:83:00:08","gateway_ips":["10.131.0.1"],"routes":[{"dest":"10.128.0.0/...
                            k8s.v1.cni.cncf.io/network-status:
                              [{
                                  "name": "ovn-kubernetes",
                                  "interface": "eth0",
                                  "ips": [
                                      "10.131.0.8"
                                  ],
                                  "mac": "0a:58:0a:83:00:08",
                                  "default": true,
                                  "dns": {}
                              }]
                            openshift.io/scc: restricted-v2
                            seccomp.security.alpha.kubernetes.io/pod: runtime/default
      Status:               Running
      SeccompProfile:       RuntimeDefault
      IP:                   10.131.0.8
      IPs:
        IP:           10.131.0.8
      Controlled By:  ReplicaSet/image-registry-5bb44d4b85
      Containers:
        registry:
          Container ID:  cri-o://34e8b0fe34e753aa2aa034fc4b9358019a6222b2a4d7dd38060855d83462e342
          Image:         quay.io/openshift-release-dev/ocp-v4.0-art-dev@sha256:fc653a9c24392d7e51552bdfef8e3eb681eea1b04a1cc8cbd11069b4e19043d5
          Image ID:      quay.io/openshift-release-dev/ocp-v4.0-art-dev@sha256:fc653a9c24392d7e51552bdfef8e3eb681eea1b04a1cc8cbd11069b4e19043d5
          Port:          5000/TCP
          Host Port:     0/TCP
          Command:
            /bin/sh
            -c
            mkdir -p /etc/pki/ca-trust/extracted/edk2 /etc/pki/ca-trust/extracted/java /etc/pki/ca-trust/extracted/openssl /etc/pki/ca-trust/extracted/pem && update-ca-trust extract && exec /usr/bin/dockerregistry
          State:          Waiting
            Reason:       CrashLoopBackOff
          Last State:     Terminated
            Reason:       Error
            Exit Code:    2
            Started:      Mon, 08 Apr 2024 20:31:27 +0800
            Finished:     Mon, 08 Apr 2024 20:31:27 +0800
          Ready:          False
          Restart Count:  10
          Requests:
            cpu:      100m
            memory:   256Mi
          Liveness:   http-get https://:5000/healthz delay=5s timeout=5s period=10s #success=1 #failure=3
          Readiness:  http-get https://:5000/healthz delay=15s timeout=5s period=10s #success=1 #failure=3
          Environment:
            REGISTRY_STORAGE:                                    swift
            REGISTRY_STORAGE_SWIFT_CONTAINER:                    mihuangoshv-ww9wt-image-registry-giyxpcgenqsafvnmyeljrrwdwkglp
            REGISTRY_STORAGE_SWIFT_AUTHURL:                      https://rhos-d.infra.prod.upshift.rdu2.redhat.com:13000/v3/auth/tokens
            REGISTRY_STORAGE_SWIFT_USERNAME:                     <set to the key 'REGISTRY_STORAGE_SWIFT_USERNAME' in secret 'image-registry-private-configuration'>                     Optional: false
            REGISTRY_STORAGE_SWIFT_PASSWORD:                     <set to the key 'REGISTRY_STORAGE_SWIFT_PASSWORD' in secret 'image-registry-private-configuration'>                     Optional: false
            REGISTRY_STORAGE_SWIFT_APPLICATIONCREDENTIALID:      <set to the key 'REGISTRY_STORAGE_SWIFT_APPLICATIONCREDENTIALID' in secret 'image-registry-private-configuration'>      Optional: false
            REGISTRY_STORAGE_SWIFT_APPLICATIONCREDENTIALNAME:    <set to the key 'REGISTRY_STORAGE_SWIFT_APPLICATIONCREDENTIALNAME' in secret 'image-registry-private-configuration'>    Optional: false
            REGISTRY_STORAGE_SWIFT_APPLICATIONCREDENTIALSECRET:  <set to the key 'REGISTRY_STORAGE_SWIFT_APPLICATIONCREDENTIALSECRET' in secret 'image-registry-private-configuration'>  Optional: false
            REGISTRY_STORAGE_SWIFT_AUTHVERSION:                  3
            REGISTRY_STORAGE_SWIFT_DOMAIN:                       redhat.com
            REGISTRY_STORAGE_SWIFT_TENANT:                       openshift-qe-jenkins
            REGISTRY_STORAGE_SWIFT_TENANTID:                     542c6ebd48bf40fa857fc245c7572e30
            REGISTRY_STORAGE_SWIFT_REGION:                       regionOne
            REGISTRY_HTTP_ADDR:                                  :5000
            REGISTRY_HTTP_NET:                                   tcp
            REGISTRY_HTTP_SECRET:                                24ee7e46142fdb502750944fb8c59e9850b9e3faf3c3f8bfd558a2c80ed5b017b595147cf3856fc38db2f3d474f74797a340a1a13876446c027362c691f97307
            REGISTRY_LOG_LEVEL:                                  info
            REGISTRY_OPENSHIFT_QUOTA_ENABLED:                    true
            REGISTRY_STORAGE_CACHE_BLOBDESCRIPTOR:               inmemory
            REGISTRY_STORAGE_DELETE_ENABLED:                     true
            REGISTRY_HEALTH_STORAGEDRIVER_ENABLED:               true
            REGISTRY_HEALTH_STORAGEDRIVER_INTERVAL:              10s
            REGISTRY_HEALTH_STORAGEDRIVER_THRESHOLD:             1
            REGISTRY_OPENSHIFT_METRICS_ENABLED:                  true
            REGISTRY_OPENSHIFT_SERVER_ADDR:                      image-registry.openshift-image-registry.svc:5000
            REGISTRY_HTTP_TLS_CERTIFICATE:                       /etc/secrets/tls.crt
            REGISTRY_HTTP_TLS_KEY:                               /etc/secrets/tls.key
          Mounts:
            /etc/pki/ca-trust/extracted from ca-trust-extracted (rw)
            /etc/pki/ca-trust/source/anchors from registry-certificates (rw)
            /etc/secrets from registry-tls (rw)
            /usr/share/pki/ca-trust-source from trusted-ca (rw)
            /var/lib/kubelet/ from installation-pull-secrets (rw)
            /var/run/secrets/kubernetes.io/serviceaccount from kube-api-access-pctgm (ro)
            /var/run/secrets/openshift/serviceaccount from bound-sa-token (ro)
      Conditions:
        Type              Status
        Initialized       True 
        Ready             False 
        ContainersReady   False 
        PodScheduled      True 
      Volumes:
        registry-tls:
          Type:                Projected (a volume that contains injected data from multiple sources)
          SecretName:          image-registry-tls
          SecretOptionalName:  <nil>
        ca-trust-extracted:
          Type:       EmptyDir (a temporary directory that shares a pod's lifetime)
          Medium:     
          SizeLimit:  <unset>
        registry-certificates:
          Type:      ConfigMap (a volume populated by a ConfigMap)
          Name:      image-registry-certificates
          Optional:  false
        trusted-ca:
          Type:      ConfigMap (a volume populated by a ConfigMap)
          Name:      trusted-ca
          Optional:  true
        installation-pull-secrets:
          Type:        Secret (a volume populated by a Secret)
          SecretName:  installation-pull-secrets
          Optional:    true
        bound-sa-token:
          Type:                    Projected (a volume that contains injected data from multiple sources)
          TokenExpirationSeconds:  3600
        kube-api-access-pctgm:
          Type:                     Projected (a volume that contains injected data from multiple sources)
          TokenExpirationSeconds:   3607
          ConfigMapName:            kube-root-ca.crt
          ConfigMapOptional:        <nil>
          DownwardAPI:              true
          ConfigMapName:            openshift-service-ca.crt
          ConfigMapOptional:        <nil>
      QoS Class:                    Burstable
      Node-Selectors:               kubernetes.io/os=linux
      Tolerations:                  node.kubernetes.io/memory-pressure:NoSchedule op=Exists
                                    node.kubernetes.io/not-ready:NoExecute op=Exists for 300s
                                    node.kubernetes.io/unreachable:NoExecute op=Exists for 300s
      Topology Spread Constraints:  kubernetes.io/hostname:DoNotSchedule when max skew 1 is exceeded for selector docker-registry=default
                                    node-role.kubernetes.io/worker:DoNotSchedule when max skew 1 is exceeded for selector docker-registry=default
                                    topology.kubernetes.io/zone:DoNotSchedule when max skew 1 is exceeded for selector docker-registry=default
      Events:
        Type     Reason            Age                    From               Message
        ----     ------            ----                   ----               -------
        Warning  FailedScheduling  29m                    default-scheduler  0/3 nodes are available: 3 node(s) had untolerated taint {node-role.kubernetes.io/master: }. preemption: 0/3 nodes are available: 3 Preemption is not helpful for scheduling..
        Warning  FailedScheduling  27m (x2 over 29m)      default-scheduler  0/3 nodes are available: 3 node(s) had untolerated taint {node-role.kubernetes.io/master: }. preemption: 0/3 nodes are available: 3 Preemption is not helpful for scheduling..
        Normal   Scheduled         27m                    default-scheduler  Successfully assigned openshift-image-registry/image-registry-5bb44d4b85-mt4dp to mihuangoshv-ww9wt-worker-0-l9rn9
        Normal   AddedInterface    27m                    multus             Add eth0 [10.131.0.8/23] from ovn-kubernetes
        Normal   Started           27m (x4 over 27m)      kubelet            Started container registry
        Normal   Pulled            26m (x5 over 27m)      kubelet            Container image "quay.io/openshift-release-dev/ocp-v4.0-art-dev@sha256:fc653a9c24392d7e51552bdfef8e3eb681eea1b04a1cc8cbd11069b4e19043d5" already present on machine
        Normal   Created           26m (x5 over 27m)      kubelet            Created container registry
        Warning  BackOff           2m47s (x129 over 27m)  kubelet            Back-off restarting failed container registry in pod image-registry-5bb44d4b85-mt4dp_openshift-image-registry(ad37eedf-7dc7-4d8c-9527-c4db613ddac3)
       

            maandre@redhat.com Martin André
            mihuang@redhat.com Mingxia Huang
            xiujuan wang xiujuan wang
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

              Created:
              Updated:
              Resolved: