Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-22309

periodic-ci-openshift-release-master-nightly-4.13-e2e-metal-ipi-sdn-bm failing since Oct 19

XMLWordPrintable

    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • 1
    • None
    • No
    • None
    • None
    • None
    • Metal Platform 244
    • 1
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      Sippy shows that baremetal blocking job on 4.13 started failing on Oct 19th and hasn't recovered. The two prevailing causes of failure are lease aquisition and metalkube returning an Internal Server Error.

      https://sippy.dptools.openshift.org/sippy-ng/jobs/4.13/analysis?filters=%7B%22items%22%3A%5B%7B%22columnField%22%3A%22name%22%2C%22operatorValue%22%3A%22equals%22%2C%22value%22%3A%22periodic-ci-openshift-release-master-nightly-4.13-e2e-metal-ipi-sdn-bm%22%7D%5D%7D

       

      Lease acquisition is known and straight forward, here's an example of the ISE.

      https://prow.ci.openshift.org/view/gs/origin-ci-test/logs/periodic-ci-openshift-release-master-nightly-4.13-e2e-metal-ipi-sdn-bm/1716554939418808320

       

      ```
      info: Mirroring completed in 0s (0B/s)
      645
      E1023 21:33:03.494942 66 memcache.go:238] couldn't get current server API group list: Get "https://api.ostest.test.metalkube.org:6443/api?timeout=32s": Internal Server Error
      646
      E1023 21:33:03.558487 66 memcache.go:238] couldn't get current server API group list: Get "https://api.ostest.test.metalkube.org:6443/api?timeout=32s": Internal Server Error
      647
      E1023 21:33:03.621854 66 memcache.go:238] couldn't get current server API group list: Get "https://api.ostest.test.metalkube.org:6443/api?timeout=32s": Internal Server Error
      648
      E1023 21:33:03.685509 66 memcache.go:238] couldn't get current server API group list: Get "https://api.ostest.test.metalkube.org:6443/api?timeout=32s": Internal Server Error
      649
      E1023 21:33:03.749401 66 memcache.go:238] couldn't get current server API group list: Get "https://api.ostest.test.metalkube.org:6443/api?timeout=32s": Internal Server Error
      650
      Unable to connect to the server: Internal Server Error
      651
      failed to ack the 4.9 Kube v1beta1 removals; possibly API-server issue, or a pre-4.8 release image
      652
      E1023 21:33:03.864235 84 memcache.go:238] couldn't get current server API group list: Get "https://api.ostest.test.metalkube.org:6443/api?timeout=32s": Internal Server Error
      653
      E1023 21:33:03.927752 84 memcache.go:238] couldn't get current server API group list: Get "https://api.ostest.test.metalkube.org:6443/api?timeout=32s": Internal Server Error
      654
      E1023 21:33:03.991368 84 memcache.go:238] couldn't get current server API group list: Get "https://api.ostest.test.metalkube.org:6443/api?timeout=32s": Internal Server Error
      655
      E1023 21:33:04.054933 84 memcache.go:238] couldn't get current server API group list: Get "https://api.ostest.test.metalkube.org:6443/api?timeout=32s": Internal Server Error
      656
      Unable to connect to the server: Internal Server Error
      657

      {"component":"entrypoint","error":"wrapped process failed: exit status 1","file":"k8s.io/test-infra/prow/entrypoint/run.go:84","func":"k8s.io/test-infra/prow/entrypoint.Options.internalRun","level":"error","msg":"Error executing test process","severity":"error","time":"2023-10-23T21:33:04Z"}

      658
      error: failed to execute wrapped command: exit status 1
      659
      INFO[2023-10-23T21:33:04Z] Step e2e-metal-ipi-sdn-bm-baremetalds-e2e-test failed after 1m53s.
      660
      INFO[2023-10-23T21:33:04Z] Step phase test failed after 1m53s.
      ```

              dhiggins@redhat.com Derek Higgins
              rhn-support-sdodson Scott Dodson
              None
              None
              Jad Haj Yahya Jad Haj Yahya
              None
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated:
                Resolved: