Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-36653

WMCO CrashLoopBackOff after upgrading from 10.15 to 10.16

XMLWordPrintable

    • No
    • 0
    • WINC - Sprint 256
    • 1
    • False
    • Hide

      None

      Show
      None
    • Release Note Not Required
    • In Progress

      This is a clone of issue OCPBUGS-36395. The following is the description of the original issue:

      Description of problem:

      pod/windows-machine-config-operator-6fcb8974f-ps8rz   0/1     CrashLoopBackOff   6 (112s ago)   13mAzure node upgrade:
      Following upgrade process described in https://issues.redhat.com/browse/OCPBUGS-22984
      Creating new catalog source and uninstalling the operatordelete subscription in WMCO namespacedelete csv in WMCO namespacedelete operatorgroup in WMCO namespace
      Even changing the catalogsource
      ``` oc edit catalogsources.operators.coreos.com wmco -n openshift-marketplace ```
      With new index image similar results
      

      Version-Release number of selected component (if applicable):

          10.16.0-60c5aec

      How reproducible:

          100%

      Steps to Reproduce:

          1. Install 4.15 Azure 2019 cluster WCMO 10.16 with 2 Windows machineset nodes, 2 BYOH nodes
          2. upgrade OCP version to 4.16
          3. change t0 10.16 index image
          

      Actual results:

          WMCO is restarting and crashing with
      Events:
        Type     Reason     Age                   From               Message
        ----     ------     ----                  ----               -------
        Normal   Scheduled  29m                   default-scheduler  Successfully assigned openshift-windows-machine-config-operator/windows-machine-config-operator-6fcb8974f-ps8rz to rrasouli-2096-nzljx-master-0
        Normal   Pulling    29m                   kubelet            Pulling image "registry.redhat.io/openshift4-wincw/windows-machine-config-rhel9-operator@sha256:e650bacf6684ad263dcdc4798673d27a5f690279e5faeda28b7b9054f263da75"
        Normal   Pulled     29m                   kubelet            Successfully pulled image "registry.redhat.io/openshift4-wincw/windows-machine-config-rhel9-operator@sha256:e650bacf6684ad263dcdc4798673d27a5f690279e5faeda28b7b9054f263da75" in 16.26s (16.26s including waiting)
        Normal   Created    24m (x5 over 29m)     kubelet            Created container manager
        Normal   Started    24m (x5 over 29m)     kubelet            Started container manager
        Normal   Pulled     24m (x4 over 28m)     kubelet            Container image "registry.redhat.io/openshift4-wincw/windows-machine-config-rhel9-operator@sha256:e650bacf6684ad263dcdc4798673d27a5f690279e5faeda28b7b9054f263da75" already present on machine
        Warning  BackOff    4m34s (x84 over 27m)  kubelet            Back-off restarting failed container manager in pod windows-machine-config-operator-6fcb8974f-ps8rz_openshift-windows-machine-config-operator(f88ea84a-0be8-44c8-bd0e-42ea03253ac4)
      

      Expected results:

          no restarts stable WMCO

      Additional info:

          oc logs deployment.apps/windows-machine-config-operator -n openshift-windows-machine-config-operator -f
      {"level":"info","ts":"2024-07-01T15:34:24Z","logger":"version","msg":"operator","version":"10.16.0-60c5aec"}
      {"level":"info","ts":"2024-07-01T15:34:24Z","logger":"version","msg":"go","version":"go1.21.9 (Red Hat 1.21.9-1.el9_4) linux/amd64"}
      {"level":"info","ts":"2024-07-01T15:34:24Z","logger":"leader","msg":"Trying to become the leader."}
      {"level":"info","ts":"2024-07-01T15:34:25Z","logger":"leader","msg":"Found existing lock with my name. I was likely restarted."}
      {"level":"info","ts":"2024-07-01T15:34:25Z","logger":"leader","msg":"Continuing as the leader."}
      {"level":"info","ts":"2024-07-01T15:34:25Z","logger":"setup","msg":"operator","namespace":"openshift-windows-machine-config-operator"}
      {"level":"error","ts":"2024-07-01T15:34:25Z","logger":"controller.secret","msg":"Unable to retrieve private key, please ensure it is created","error":"the cache is not started, can not read objects","stacktrace":"github.com/openshift/windows-machine-config-operator/controllers.(*SecretReconciler).SetupWithManager\n\t/remote-source/build/windows-machine-config-operator/controllers/secret_controller.go:64\nmain.main\n\t/remote-source/build/windows-machine-config-operator/cmd/operator/main.go:219\nruntime.main\n\t/usr/lib/golang/src/runtime/proc.go:267"}
      {"level":"info","ts":"2024-07-01T15:34:25Z","logger":"setup","msg":"starting manager"}
      {"level":"info","ts":"2024-07-01T15:34:25Z","logger":"controller-runtime.metrics","msg":"Starting metrics server"}
      {"level":"info","ts":"2024-07-01T15:34:25Z","logger":"controller-runtime.metrics","msg":"Serving metrics server","bindAddress":"0.0.0.0:9182","secure":false}
      {"level":"info","ts":"2024-07-01T15:34:27Z","msg":"Starting EventSource","controller":"machine","controllerGroup":"machine.openshift.io","controllerKind":"Machine","source":"kind source: *v1beta1.Machine"}
      {"level":"info","ts":"2024-07-01T15:34:27Z","msg":"Starting EventSource","controller":"machine","controllerGroup":"machine.openshift.io","controllerKind":"Machine","source":"kind source: *v1.Node"}
      {"level":"info","ts":"2024-07-01T15:34:27Z","msg":"Starting Controller","controller":"machine","controllerGroup":"machine.openshift.io","controllerKind":"Machine"}
      {"level":"info","ts":"2024-07-01T15:34:27Z","msg":"Starting EventSource","controller":"secret","controllerGroup":"","controllerKind":"Secret","source":"kind source: *v1.Secret"}
      {"level":"info","ts":"2024-07-01T15:34:27Z","msg":"Starting EventSource","controller":"imagedigestmirrorset","controllerGroup":"config.openshift.io","controllerKind":"ImageDigestMirrorSet","source":"kind source: *v1.ImageDigestMirrorSet"}
      {"level":"info","ts":"2024-07-01T15:34:27Z","msg":"Starting EventSource","controller":"secret","controllerGroup":"","controllerKind":"Secret","source":"kind source: *v1.Secret"}
      {"level":"info","ts":"2024-07-01T15:34:27Z","msg":"Starting EventSource","controller":"imagedigestmirrorset","controllerGroup":"config.openshift.io","controllerKind":"ImageDigestMirrorSet","source":"kind source: *v1.ImageTagMirrorSet"}
      {"level":"info","ts":"2024-07-01T15:34:27Z","msg":"Starting EventSource","controller":"imagedigestmirrorset","controllerGroup":"config.openshift.io","controllerKind":"ImageDigestMirrorSet","source":"kind source: *v1.Secret"}
      {"level":"info","ts":"2024-07-01T15:34:27Z","msg":"Starting Controller","controller":"secret","controllerGroup":"","controllerKind":"Secret"}
      {"level":"info","ts":"2024-07-01T15:34:27Z","msg":"Starting EventSource","controller":"controllerconfig","controllerGroup":"machineconfiguration.openshift.io","controllerKind":"ControllerConfig","source":"kind source: *v1.ControllerConfig"}
      {"level":"info","ts":"2024-07-01T15:34:27Z","msg":"Starting EventSource","controller":"node","controllerGroup":"","controllerKind":"Node","source":"kind source: *v1.Node"}
      {"level":"info","ts":"2024-07-01T15:34:27Z","msg":"Starting Controller","controller":"node","controllerGroup":"","controllerKind":"Node"}
      {"level":"info","ts":"2024-07-01T15:34:27Z","msg":"Starting Controller","controller":"imagedigestmirrorset","controllerGroup":"config.openshift.io","controllerKind":"ImageDigestMirrorSet"}
      {"level":"info","ts":"2024-07-01T15:34:27Z","msg":"Starting Controller","controller":"controllerconfig","controllerGroup":"machineconfiguration.openshift.io","controllerKind":"ControllerConfig"}
      {"level":"info","ts":"2024-07-01T15:34:27Z","msg":"Starting EventSource","controller":"configmap","controllerGroup":"","controllerKind":"ConfigMap","source":"kind source: *v1.ConfigMap"}
      {"level":"info","ts":"2024-07-01T15:34:27Z","msg":"Starting EventSource","controller":"configmap","controllerGroup":"","controllerKind":"ConfigMap","source":"kind source: *v1.Node"}
      {"level":"info","ts":"2024-07-01T15:34:27Z","msg":"Starting EventSource","controller":"configmap","controllerGroup":"","controllerKind":"ConfigMap","source":"kind source: *v1.Node"}
      {"level":"info","ts":"2024-07-01T15:34:27Z","msg":"Starting Controller","controller":"configmap","controllerGroup":"","controllerKind":"ConfigMap"}
      {"level":"info","ts":"2024-07-01T15:34:27Z","msg":"Starting EventSource","controller":"certificatesigningrequest","controllerGroup":"certificates.k8s.io","controllerKind":"CertificateSigningRequest","source":"kind source: *v1.CertificateSigningRequest"}
      {"level":"info","ts":"2024-07-01T15:34:27Z","msg":"Starting Controller","controller":"certificatesigningrequest","controllerGroup":"certificates.k8s.io","controllerKind":"CertificateSigningRequest"}
      {"level":"info","ts":"2024-07-01T15:34:27Z","msg":"Starting workers","controller":"secret","controllerGroup":"","controllerKind":"Secret","worker count":1}
      {"level":"info","ts":"2024-07-01T15:34:28Z","msg":"Starting workers","controller":"imagedigestmirrorset","controllerGroup":"config.openshift.io","controllerKind":"ImageDigestMirrorSet","worker count":1}
      {"level":"info","ts":"2024-07-01T15:34:28Z","msg":"Starting workers","controller":"machine","controllerGroup":"machine.openshift.io","controllerKind":"Machine","worker count":1}
      {"level":"info","ts":"2024-07-01T15:34:28Z","msg":"Starting workers","controller":"node","controllerGroup":"","controllerKind":"Node","worker count":1}
      {"level":"info","ts":"2024-07-01T15:34:28Z","msg":"Starting workers","controller":"controllerconfig","controllerGroup":"machineconfiguration.openshift.io","controllerKind":"ControllerConfig","worker count":1}
      {"level":"info","ts":"2024-07-01T15:34:28Z","msg":"Starting workers","controller":"certificatesigningrequest","controllerGroup":"certificates.k8s.io","controllerKind":"CertificateSigningRequest","worker count":1}
      {"level":"info","ts":"2024-07-01T15:34:28Z","msg":"Starting workers","controller":"configmap","controllerGroup":"","controllerKind":"ConfigMap","worker count":1}
      {"level":"info","ts":"2024-07-01T15:34:47Z","logger":"controllers.controllerconfig","msg":"updating kubelet CA client certificates in","node":"rrasouli-2096-1"}
      {"level":"info","ts":"2024-07-01T15:34:48Z","logger":"controllers.registry","msg":"updating containerd config","registry":{"name":""},"directory":"C:\\k\\containerd\\registries","node":"rrasouli-2096-1"}
      {"level":"info","ts":"2024-07-01T15:34:57Z","logger":"controllers.configmap","msg":"processing","instances in":"windows-instances"}
      {"level":"info","ts":"2024-07-01T15:34:57Z","logger":"controller.windowsmachine","msg":"processing","windowsmachine":{"name":"windows-dfpqd","namespace":"openshift-machine-api"},"address":"10.0.128.7"}

              rh-ee-mankulka Mansi Kulkarni
              openshift-crt-jira-prow OpenShift Prow Bot
              Weinan Liu Weinan Liu
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

                Created:
                Updated:
                Resolved: