Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-48714

Day2 monitoring is not handling api server temporarily disconnection

XMLWordPrintable

    • Moderate
    • None
    • Installer (PB) Sprint 265, Installer Sprint 266
    • 2
    • False
    • Hide

      None

      Show
      None
    • Hide
      Cause: A temporary api server disconnect causes the "oc adm node-image monitor" command to fail.

      Consequence: The monitor command fails showing error/EOF.

      Fix: When a temporary api server disconnect happens, it is ignored and the monitor command retries to connect to the api server.

      Result: The monitor command does not fail when there is a temporary api server disconnect.
      Show
      Cause: A temporary api server disconnect causes the "oc adm node-image monitor" command to fail. Consequence: The monitor command fails showing error/EOF. Fix: When a temporary api server disconnect happens, it is ignored and the monitor command retries to connect to the api server. Result: The monitor command does not fail when there is a temporary api server disconnect.
    • Bug Fix
    • Proposed

      This is a clone of issue OCPBUGS-38975. The following is the description of the original issue:

      Description of problem:

        Day2 monitoring is not handling api server temporarily disconnection  

      Version-Release number of selected component (if applicable):

          4.17.0-0.ci-2024-08-26-170911

      How reproducible:

          always in libvirt manually run

      Steps to Reproduce:

          1. Run agent install in libvirt env manually
          2. Run day2 install after cluster is installed succeed
          3. Run 'oc adm node-image monitor' to track the day2 install, when there is api server temporarily disconnection , monitoring program will run into error/EOF.
          4, Only reproduced in libvirt env, baremetal platform is working fine.     

      Actual results:

          Day2 monitoring should run without break to track day2 install in libvirt

      Expected results:

          Day2 monitoring run into error/EOF

      Additional info:

          Monitoring output link: https://docs.google.com/spreadsheets/d/17cOCfYvqxLHlhzBHkwCnFZDUatDRcG1Ej-HQDTDin0c/edit?gid=0#gid=0

              rwsu1@redhat.com Richard Su
              openshift-crt-jira-prow OpenShift Prow Bot
              Manoj Hans Manoj Hans
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated: