Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-50859

Loss of etcd quorum and leadership exceeds 1 minute

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Undefined Undefined
    • 4.19.0
    • 4.18.0, 4.19.0
    • Etcd
    • None

      In order to reproduce the issue in https://issues.redhat.com/browse/OCPBUGS-48400, I implemented an e2e monitoring test in https://issues.redhat.com/browse/CNTRLPLANE-195.

      In one of the ci runs, the test has failed

      Run #0: Failed expand_less	0s
      {  
      etcd cluster did not have a leader for 1m0.641024204s
      Feb 14 03:01:00.938 - 1s    W node/ip-10-0-53-48.ec2.internal etcd-member/bdf6573689c309ec constructed/etcd-lifecycle-constructor leader/bdf6573689c309ec term/7
      Feb 14 03:02:02.580 - 50s   W node/ip-10-0-53-48.ec2.internal etcd-member/bdf6573689c309ec constructed/etcd-lifecycle-constructor leader/bdf6573689c309ec term/7}
      

      However, this test failure is after cluster bootstrap has been finished. However, this bus is to discover the root cause of this failure.

              melbeher@redhat.com Mustafa Elbehery
              melbeher@redhat.com Mustafa Elbehery
              Ge Liu Ge Liu
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

                Created:
                Updated: