Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-18905

Cluster failed to precache and did not retry because of "Failed to connect to kubernetes.default.svc port 443: Connection refused"

XMLWordPrintable

    • No
    • CNF RAN Sprint 242, CNF RAN Sprint 243, CNF RAN Sprint 244
    • 3
    • False
    • Hide

      None

      Show
      None
    • 9/19: telco priority pending

      Description of problem:

      While upgrading 3557 from 4.12.27 to 4.12.29 and precaching all clusters before the upgrade.  One cluster failed early during the precaching with the following in the logs
      
      # oc --kubeconfig /root/hv-vm/kc/vm00311/kubeconfig logs -n openshift-talo-pre-cache pre-cache-47nlx   % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current                                 Dload  Upload   Total   Spent    Left  Speed  0     0    0     0    0     0      0      0 --:--:-- --:--:-- --:--:--     0curl: (7) Failed to connect to kubernetes.default.svc port 443: Connection refusedhighThresholdPercent:  diskSize:125293548 used:30470528ERROR: not enough space for precaching
      
      It appears that the precaching script failed to complete a curl request because of an unexpected intermittent api outage.
      
      https://github.com/openshift-kni/cluster-group-upgrades-operator/blob/release-4.13/pre-cache/check_space#L8
      
      Shouldn't the script retry and or have the job pod retry?

      Version-Release number of selected component (if applicable):

      ACM - 2.9.0-DOWNSTREAM-2023-08-28-21-42-15
      Hub OCP 4.13.10
      Deployed SNOs - 4.12.27
      TALM - 4.13.0 (Threaded with 5 threads)

      How reproducible:

       

      Steps to Reproduce:

      1.
      2.
      3.
      

      Actual results:

       

      Expected results:

       

      Additional info:

       

            saskari@redhat.com Saeid Askari
            akrzos@redhat.com Alex Krzos
            Dan Radez Dan Radez
            Votes:
            0 Vote for this issue
            Watchers:
            8 Start watching this issue

              Created:
              Updated:
              Resolved: