Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-35719

Installer fail to scale up all etcd members(1 ready, 2 pending)

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Major Major
    • None
    • 4.16
    • kube-apiserver
    • None
    • No
    • Rejected
    • False
    • Hide

      None

      Show
      None

      Description of problem:

      installer failed at waiting for bootstrap completed, one etcd pod is ready, and it pending for other 2 etcd pods:

      Thu 2024-06-13 14:06:42 UTC ci-op-8wnbnpks-d45e3-7mjh5-bootstrap kubelet.service[3150]: I0613 14:06:42.764345 3150 kubelet_getters.go:187] "Pod status updated" pod="openshift-etcd/etcd-bootstrap-member-ci-op-8wnbnpks-d45e3-7mjh5-bootstrap" status="Running"
      Thu 2024-06-13 14:06:42 UTC ci-op-8wnbnpks-d45e3-7mjh5-bootstrap kubelet.service[3150]: I0613 14:06:42.766724 3150 sysinfo.go:242] Found node without cache information, nodeDir: /sys/devices/system/node/node0
      Thu 2024-06-13 14:06:43 UTC ci-op-8wnbnpks-d45e3-7mjh5-bootstrap progress.service[3187]: Waiting for at least 2 available IP addresses for the default/kubernetes service
      Thu 2024-06-13 14:06:43 UTC ci-op-8wnbnpks-d45e3-7mjh5-bootstrap progress.service[3187]: Waiting for at least 2 available IP addresses for the default/kubernetes service
      Thu 2024-06-13 14:06:43 UTC ci-op-8wnbnpks-d45e3-7mjh5-bootstrap progress.service[3187]: Got the following addresses for the default/kubernetes endpoint object: 10.0.0.7
      Thu 2024-06-13 14:06:47 UTC ci-op-8wnbnpks-d45e3-7mjh5-bootstrap kubelet.service[3150]: I0613 14:06:47.408265 3150 kubelet_node_status.go:401] "Setting node annotation to enable volume controller attach/detach"
      .....................................
      .........
      Thu 2024-06-13 14:11:00 UTC ci-op-8wnbnpks-d45e3-7mjh5-bootstrap progress.service[3187]: The following error happened while retrieving the default/kubernetes endpoint object
      Thu 2024-06-13 14:11:00 UTC ci-op-8wnbnpks-d45e3-7mjh5-bootstrap progress.service[3187]: E0613 14:09:00.886699 7944 memcache.go:265] couldn't get current server API group list: Get "https://api-int.ci-op-8wnbnpks-d45e3.qe.azure.devcluster.openshift.com:6443/api?timeout=32s": dial tcp 10.0.0.4:6443: i/o timeout
      Thu 2024-06-13 14:11:00 UTC ci-op-8wnbnpks-d45e3-7mjh5-bootstrap progress.service[3187]: E0613 14:09:30.888511 7944 memcache.go:265] couldn't get current server API group list: Get "https://api-int.ci-op-8wnbnpks-d45e3.qe.azure.devcluster.openshift.com:6443/api?timeout=32s": dial tcp 10.0.0.4:6443: i/o timeout

      all log here: https://drive.google.com/file/d/14JNhcgJOvCOs2XD1Ug3LL4UWS7ZZusgc/view?usp=drive_link

      Version-Release number of selected component (if applicable):
      4.16

      How reproducible:
      no always
      Steps to Reproduce:

      
      

      Installer failure, and it seems one etcd pod is ready, other 2 etcd pods pending on kubernetes service

      Actual results:
      installer fails

      Expected results:
      succeed to install all etcd member

      Additional info:

            Unassigned Unassigned
            rhn-support-geliu Ge Liu
            Ge Liu Ge Liu
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

              Created:
              Updated: