Uploaded image for project: 'Openshift sandboxed containers'
  1. Openshift sandboxed containers
  2. KATA-1926

Kataconfig deletion get stuck sporadically

XMLWordPrintable

    • False
    • None
    • False
    • KATA-2418 - sandboxed containers: rework kataconfig status reporting
    • Kata Sprint #231, Kata Sprint #233, Kata Sprint #235, Kata Sprint #236
    • 0
    • 0

      Description

      The issue is seen sporadically on clusters with 3 workers and kataconfig with node selector matching with single labeled node. This node appear eventually in the completedNodesList, but unInstallationStatus/inProgress stays True forever with
      finalizers in metadata:
      finalizers:
          - kataconfiguration.openshift.io/finalizer

      Steps to reproduce

      1. Deploy cluster with 3 workers
      2. Label one of them and apply kataconfig with NodeSelector:
      kataConfigPoolSelector:
            matchLabels:
              custom-kata1: test
      3. Delete kataconfig

      Expected result

      kataconfig eventually disposed after runtime been removed from the node

      Actual result

      node is been rebooted and uninstalled, but kataconfig isn't disposed finally, creating an issue, can't apply another kataconfig, can't delete it permanently, only wipe out whole cluster

      Impact

      This can cause problems to test automation and in real deployment

      Env

      OCP 4.12.0-rc.6-x86_64

      Additional helpful info

      kataconfig will be attached

        1. controller.log
          779 kB
        2. del-kataconfig-stuck.txt
          2 kB
        3. kataconfig-wedge.yaml
          1 kB
        4. kataconfig-wedge2.yaml
          1 kB
        5. kataconfig-wedge3.yaml
          1 kB
        6. machine-config-controller-74b4b5cbcf-dj5pw.log
          165 kB
        7. machine-config-daemon-ctblz.log
          14 kB
        8. machine-config-daemon-f95vw.log
          5 kB
        9. machine-config-daemon-ggzp8.log
          14 kB
        10. machine-config-daemon-nmb7v.log
          16 kB
        11. machine-config-daemon-rl42k.log
          16 kB
        12. machine-config-daemon-tp6zj.log
          16 kB
        13. machine-config-operator-649f7f8847-2m275.log
          979 kB
        14. machine-config-server-bgbll.log
          0.5 kB
        15. machine-config-server-c9mhm.log
          0.3 kB
        16. machine-config-server-sqnnx.log
          0.9 kB
        17. osc-controller-manager-8cbff6b9f-rz7tq
          1.02 MB
        18. tpb-controller.log
          376 kB
        19. tpb-controller-deploy.log
          377 kB
        20. tpb-controller-replicaset.log
          377 kB
        21. wedged-controller.log
          378 kB
        22. wedged-controller3.log
          73 kB

              pmores Pavel Mores (Inactive)
              rhn-support-vvoronko Victor Voronkov
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

                Created:
                Updated:
                Resolved: