Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-54839

Testcase failure on PowerVS CI runs - [sig-etcd] etcd should not log excessive took too long messages

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Cannot Reproduce
    • Icon: Undefined Undefined
    • None
    • 4.18, 4.19
    • Etcd
    • None
    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      Description of problem:

      Observing the below etcd  testfailure in 4.18 and 4.19 PowerVS runs

      1. [sig-etcd] etcd should not log excessive took too long messages

      { Etcd logged 10493 'took too long' messages, this test fails on any value over 10000 as this is a strong indicator that etcd was very unhealthy throughout the run. This can cause sparodic e2e failures and disruption and typically indicates faster disks are needed. These log message intervals are included in spyglass chart artifacts and can be used to correlate with disruption and failed tests.}

      2. [bz-etcd][invariant] alert/etcdMemberCommunicationSlow should not be at or above info

      { etcdMemberCommunicationSlow was at or above info for at least 2m22s on platformidentification.JobType{Release:"4.19", FromRelease:"", Platform:"", Architecture:"ppc64le", Network:"ovn", Topology:"ha"} (maxAllowed=0s): pending for 9m30s, firing for 2m22s: Apr 10 00:51:38.876 - 58s W namespace/openshift-etcd node/192.168.124.10:9979 pod/etcd-p-lon06-1-capi-419-7mdjj-master-0 alert/etcdMemberCommunicationSlow alertstate/firing severity/warning ALERTS{To="890f003e4491e681", alertname="etcdMemberCommunicationSlow", alertstate="firing", endpoint="etcd-metrics", instance="192.168.124.10:9979", job="etcd", namespace="openshift-etcd", pod="etcd-p-lon06-1-capi-419-7mdjj-master-0", prometheus="openshift-monitoring/k8s", service="etcd", severity="warning"} Apr 10 00:52:08.876 - 28s W namespace/openshift-etcd node/192.168.124.11:9979 pod/etcd-p-lon06-1-capi-419-7mdjj-master-1 alert/etcdMemberCommunicationSlow alertstate/firing severity/warning ALERTS{To="28d2c243d792bfce", alertname="etcdMemberCommunicationSlow", alertstate="firing", endpoint="etcd-metrics", instance="192.168.124.11:9979", job="etcd", namespace="openshift-etcd", pod="etcd-p-lon06-1-capi-419-7mdjj-master-1", prometheus="openshift-monitoring/k8s", service="etcd", severity="warning"} Apr 10 00:52:08.876 - 28s W namespace/openshift-etcd node/192.168.124.12:9979 pod/etcd-p-lon06-1-capi-419-7mdjj-master-2 alert/etcdMemberCommunicationSlow alertstate/firing severity/warning ALERTS{To="890f003e4491e681", alertname="etcdMemberCommunicationSlow", alertstate="firing", endpoint="etcd-metrics", instance="192.168.124.12:9979", job="etcd", namespace="openshift-etcd", pod="etcd-p-lon06-1-capi-419-7mdjj-master-2", prometheus="openshift-monitoring/k8s", service="etcd", severity="warning"} Apr 10 00:52:08.876 - 28s W namespace/openshift-etcd node/192.168.124.11:9979 pod/etcd-p-lon06-1-capi-419-7mdjj-master-1 alert/etcdMemberCommunicationSlow alertstate/firing severity/warning ALERTS{To="dbe386c0426da004", alertname="etcdMemberCommunicationSlow", alertstate="firing", endpoint="etcd-metrics", instance="192.168.124.11:9979", job="etcd", namespace="openshift-etcd", pod="etcd-p-lon06-1-capi-419-7mdjj-master-1", prometheus="openshift-monitoring/k8s", service="etcd", severity="warning"}}

      Joblink : 4.184.19

       

              mturek.coreos Michael Turek
              karumuga28 Keerthana Arumugam
              None
              None
              Ge Liu Ge Liu
              None
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

                Created:
                Updated:
                Resolved: