Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-3547

On an SNO DU some of the test workload pods get restarted due to failed probes when leaving the node running for multiple days

    XMLWordPrintable

Details

    • Important
    • CNF RAN Sprint 230, CNF RAN Sprint 231, CNF RAN Sprint 232, CNF RAN Sprint 233, CNF RAN Sprint 234, CNF RAN Sprint 235, CNF RAN Sprint 236, CNF RAN Sprint 237, CNF RAN Sprint 238, CNF RAN Sprint 239, CNF RAN Sprint 240, CNF RAN Sprint 241, CNF RAN Sprint 242, CNF RAN Sprint 243, CNF RAN Sprint 244
    • 15
    • Rejected
    • False
    • Hide

      None

      Show
      None
    • No text required - the fix was in RHEL and it is already listed in RHBA-2023:5450 advisory.
    • Bug Fix
    • Hide
      9/11: green for 4.12.z, RHELPLAN-163770 is ON_QA
      8/28: addressed in 9.2 / 4.13.z/4.14 via RHELPLAN-158160.
      Rel Note for Telco: Not Required (per DE/BW)
      Show
      9/11: green for 4.12.z, RHELPLAN-163770 is ON_QA 8/28: addressed in 9.2 / 4.13.z/4.14 via RHELPLAN-158160. Rel Note for Telco: Not Required (per DE/BW)

    Description

      Description of problem:

      On an SNO DU some of the test workload pods get restarted due to failed probes when leaving the node running for multiple days.

      Version-Release number of selected component (if applicable):

      4.12.0-0.nightly-2022-11-06-054655

      How reproducible:

      Not constantly but the issue reproduced on 2 setups after leaving the application running for more than 2 days

      Steps to Reproduce:

      1. Deploy SNO with DU profile
      2. Create test app from https://gitlab.cee.redhat.com/ocp-edge-qe/vdu-workload-emulator
      3. Leave the setup running for 2 days
      4. Check test app pods status

      Actual results:

      Some of the pods report restarts:
      
      [kni@registry.kni-qe-22 ~]$ oc -n test get pods
      NAME                                   READY   STATUS    RESTARTS        AGE
      test1-deployment-69499db98d-688q9      1/1     Running   0               3d20h
      test10-0                               2/2     Running   0               3d20h
      test10-1                               2/2     Running   0               3d20h
      test10-1-deployment-5b89b4668f-tmbp6   3/3     Running   0               3d20h
      test11-0                               2/2     Running   0               3d20h
      test12-0                               2/2     Running   0               3d20h
      test13-0                               2/2     Running   0               3d20h
      test14-0                               2/2     Running   0               3d20h
      test15-0                               2/2     Running   0               3d20h
      test15-1                               2/2     Running   0               3d20h
      test16-deployment-5b59fc6758-bcdmh     1/1     Running   0               3d20h
      test16-deployment-5b59fc6758-wl68b     1/1     Running   0               3d20h
      test17-0                               2/2     Running   0               3d20h
      test17-1                               2/2     Running   0               3d20h
      test17-2                               2/2     Running   0               3d20h
      test18-deployment-7cd4558687-9zfp2     1/1     Running   0               3d20h
      test19-deployment-f56c5c8bc-jcd6w      2/2     Running   0               3d20h
      test19-deployment-f56c5c8bc-vmqht      2/2     Running   0               3d20h
      test2-deployment-7c8b7b7c65-2cbsq      1/1     Running   0               3d20h
      test21-deployment-6cf7bbc97f-rdr5c     1/1     Running   0               3d20h
      test22-deployment-bf6d49d7c-26hlv      2/2     Running   0               3d20h
      test23-deployment-8d6d46dcd-db9t6      2/2     Running   0               3d20h
      test23-deployment-8d6d46dcd-x7wbn      2/2     Running   0               3d20h
      test24-deployment-859b544f64-jjfq9     1/1     Running   1 (5h44m ago)   3d20h
      test25-deployment-6ffcc8cc6d-mc7wr     1/1     Running   1 (5h44m ago)   3d20h
      test26-deployment-58546d77d4-n4cc7     3/3     Running   1 (5h44m ago)   3d20h
      test27-deployment-8565cb6fff-4s2w8     1/1     Running   0               3d20h
      test28-deployment-555c6ccf69-9gqsr     5/5     Running   2 (5h43m ago)   3d20h
      test29-deployment-7d59454fb7-5thrb     2/2     Running   2 (3h13m ago)   3d20h
      test30-deployment-69f7b5bfcb-tvwnj     4/4     Running   0               3d20h
      test32-deployment-55d7f69dfb-x4ctb     5/5     Running   0               3d20h
      test33-deployment-5ddfb848d6-dc8t8     2/2     Running   1 (5h44m ago)   3d20h
      test33-deployment-5ddfb848d6-gjj88     2/2     Running   1 (5h44m ago)   3d20h
      test33-deployment-5ddfb848d6-xbm99     2/2     Running   1 (5h44m ago)   3d20h
      test34-deployment-6bc769fb95-g8shl     2/2     Running   3 (3h13m ago)   3d20h
      test34-deployment-6bc769fb95-kf2z4     2/2     Running   2 (3h13m ago)   3d20h
      test34-deployment-6bc769fb95-vxqzq     2/2     Running   2 (3h13m ago)   3d20h
      test35-deployment-776bcbb759-g8hhb     2/2     Running   2 (3h13m ago)   3d20h
      test36-deployment-745c767b74-f5bc2     2/2     Running   3 (3h13m ago)   3d20h
      test37-deployment-596b7ddd78-jb6hg     4/4     Running   0               3d20h
      test38-deployment-755d7664b8-x9dms     7/7     Running   0               3d20h
      test39-deployment-6c8d89c4b6-wqtfx     3/3     Running   0               3d20h
      test4-deployment-7f9665cc69-qpx6k      1/1     Running   0               3d20h
      test40-deployment-5bb5f98549-ldt8v     1/1     Running   0               3d20h
      test41-deployment-694fdf6865-7642k     2/2     Running   2 (3h13m ago)   3d20h
      test42-0                               1/1     Running   0               3d20h
      test43-deployment-9b7f9ffcf-h77bh      2/2     Running   2 (3h13m ago)   3d20h
      test44-deployment-649dccb999-pzmhg     2/2     Running   0               3d20h
      test45-0                               2/2     Running   2 (3h13m ago)   3d20h
      test46-0                               2/2     Running   0               3d20h
      test47-deployment-6ccd4cfddb-6kqmh     3/3     Running   0               3d20h
      test48-deployment-d7dcfd86c-fqx69      3/3     Running   0               3d20h
      test48-deployment-d7dcfd86c-s27zw      3/3     Running   0               3d20h
      test49-deployment-6645d7d66d-tx7nf     1/1     Running   2 (3h13m ago)   3d20h
      test5-deployment-6d759959fc-mfl9z      7/7     Running   8 (3h11m ago)   3d20h
      test50-deployment-86c9ddc4c4-mvjkv     1/1     Running   0               3d20h
      test6-0                                1/1     Running   0               3d20h
      test7-0                                2/2     Running   0               3d20h
      test8-0                                2/2     Running   0               3d20h
      test9-0                                2/2     Running   0               3d20h
      

      Expected results:

      Pods do not get restarted

      Additional info:

      Attaching must-gather and sosreport.

      Attachments

        Issue Links

          Activity

            People

              rh-ee-cshulyup Costa Shulyupin
              mcornea@redhat.com Marius Cornea
              Marius Cornea Marius Cornea
              Votes:
              0 Vote for this issue
              Watchers:
              16 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: