Uploaded image for project: 'OpenShift Installer'
  1. OpenShift Installer
  2. CORS-4177

apiserver timeouts during install are not configurable

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Undefined Undefined
    • None
    • None
    • None
    • None
    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      With my laptop throttled, I was seeing my installs timeout due to the apiserver being unable to reach the etcd server:

      level=info msg=Started local control plane with envtest
      E0804 16:49:51.518514 660718 server.go:328] "unable to start the controlplane" err="timeout waiting for process kube-apiserver to start" logger="controller-runtime.test-env" tries=0
      E0804 16:50:11.972156 660718 server.go:328] "unable to start the controlplane" err="timeout waiting for process kube-apiserver to start successfully (it may have failed to start, or stopped unexpectedly before becoming ready)" logger="controller-runtime.test-env" tries=1
      E0804 16:50:29.253101 660718 server.go:328] "unable to start the controlplane" err="timeout waiting for process kube-apiserver to start" logger="controller-runtime.test-env" tries=2
      E0804 16:50:49.742748 660718 server.go:328] "unable to start the controlplane" err="timeout waiting for process kube-apiserver to start successfully (it may have failed to start, or stopped unexpectedly before becoming ready)" logger="controller-runtime.test-env" tries=3
      E0804 16:51:06.829984 660718 server.go:328] "unable to start the controlplane" err="timeout waiting for process kube-apiserver to start" logger="controller-runtime.test-env" tries=4
      level=debug msg=Collecting applied cluster api manifests...
      level=error msg=failed to fetch Cluster: failed to generate asset "Cluster": failed to create cluster: failed to run cluster api system: failed to run local control plane: unable to start control plane itself: failed to start the controlplane. retried 5 times: timeout waiting for process kube-apiserver to start

      When i put my laptop in performance mode things succeeded. I also confirmed if i started up the etcd server manually (using the same command that the installer was using to launch it), the etcd port was initially only available intermittently before stabilizing.

      my impression/theory is that etcd was taking too long to come up/stabilize and the apiserver/installer gave up.

      Increasing the default timeout and making the timeout configurable in general would be helpful to provide a workaround for people who run into this situation.

              padillon Patrick Dillon
              bparees@redhat.com Ben Parees
              None
              None
              None
              None
              None
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

                Created:
                Updated: