-
Bug
-
Resolution: Done
-
Major
-
None
-
None
-
None
job link
log snippet from e2e log:
[It] alert/etcdHighNumberOfFailedGRPCRequests should not be at or above info [Suite:openshift/conformance/parallel] github.com/openshift/origin/test/extended/prometheus/alerts.go:23 Apr 17 14:16:52.778: FAIL: etcdHighNumberOfFailedGRPCRequests was at or above info for at least 59s on platformidentification.JobType{Release:"4.11", FromRelease:"", Platform:"azure", Architecture:"amd64", Network:"ovn", Topology:"ha"} (maxAllowed=3s): pending for 28m54s, firing for 59s: Apr 17 13:41:42.000 - 59s W alert/etcdHighNumberOfFailedGRPCRequests node/10.0.0.6:9979 ns/openshift-etcd pod/etcd-ci-op-sxcpdb36-5cb9e-xmzh5-master-1 ALERTS{alertname="etcdHighNumberOfFailedGRPCRequests", alertstate="firing", endpoint="etcd-metrics", grpc_method="Watch", grpc_service="etcdserverpb.Watch", instance="10.0.0.6:9979", job="etcd", namespace="openshift-etcd", pod="etcd-ci-op-sxcpdb36-5cb9e-xmzh5-master-1", prometheus="openshift-monitoring/k8s", service="etcd", severity="warning"} Full Stack Trace github.com/openshift/origin/test/extended/prometheus.init.0.func1.1() github.com/openshift/origin/test/extended/prometheus/alerts.go:24 +0x96 github.com/onsi/ginkgo/internal/leafnodes.(*runner).runSync(0xc0000001a0) github.com/onsi/ginkgo@v4.7.0-origin.0+incompatible/internal/leafnodes/runner.go:113 +0xba github.com/onsi/ginkgo/internal/leafnodes.(*runner).run(0xc003354ea0) github.com/onsi/ginkgo@v4.7.0-origin.0+incompatible/internal/leafnodes/runner.go:64 +0x125 github.com/onsi/ginkgo/internal/leafnodes.(*ItNode).Run(0x7f2f6cb48fff) github.com/onsi/ginkgo@v4.7.0-origin.0+incompatible/internal/leafnodes/it_node.go:26 +0x7b github.com/onsi/ginkgo/internal/spec.(*Spec).runSample(0xc002b98690, 0xc003355268, {0x8c6fa00, 0xc000466c00}) github.com/onsi/ginkgo@v4.7.0-origin.0+incompatible/internal/spec/spec.go:215 +0x2a9 github.com/onsi/ginkgo/internal/spec.(*Spec).Run(0xc002b98690, {0x8c6fa00, 0xc000466c00}) github.com/onsi/ginkgo@v4.7.0-origin.0+incompatible/internal/spec/spec.go:138 +0xe7 github.com/onsi/ginkgo/internal/specrunner.(*SpecRunner).runSpec(0xc003536140, 0xc002b98690) github.com/onsi/ginkgo@v4.7.0-origin.0+incompatible/internal/specrunner/spec_runner.go:200 +0xe5 github.com/onsi/ginkgo/internal/specrunner.(*SpecRunner).runSpecs(0xc003536140) github.com/onsi/ginkgo@v4.7.0-origin.0+incompatible/internal/specrunner/spec_runner.go:170 +0x1a5 github.com/onsi/ginkgo/internal/specrunner.(*SpecRunner).Run(0xc003536140) github.com/onsi/ginkgo@v4.7.0-origin.0+incompatible/internal/specrunner/spec_runner.go:66 +0xc5 github.com/onsi/ginkgo/internal/suite.(*Suite).Run(0xc00047e000, {0x8c6fd20, 0xc0027219f0}, {0x0, 0x25f401b}, {0xc00273c4e0, 0x1, 0x1}, {0x8d84ff8, 0xc000466c00}, ...) github.com/onsi/ginkgo@v4.7.0-origin.0+incompatible/internal/suite/suite.go:62 +0x4b2 github.com/openshift/origin/pkg/test/ginkgo.(*TestOptions).Run(0xc00232bda0, {0xc000858030, 0xc6407f0, 0x484f3a0}) github.com/openshift/origin/pkg/test/ginkgo/cmd_runtest.go:61 +0x3be main.newRunTestCommand.func1.1() github.com/openshift/origin/cmd/openshift-tests/openshift-tests.go:434 +0x32 github.com/openshift/origin/test/extended/util.WithCleanup(0xc00231fc18) github.com/openshift/origin/test/extended/util/test.go:168 +0xad main.newRunTestCommand.func1(0xc00071a280, {0xc000858030, 0x1, 0x1}) github.com/openshift/origin/cmd/openshift-tests/openshift-tests.go:434 +0x349 github.com/spf13/cobra.(*Command).execute(0xc00071a280, {0xc0015ddf90, 0x1, 0x1}) github.com/spf13/cobra@v1.2.1/command.go:856 +0x60e github.com/spf13/cobra.(*Command).ExecuteC(0xc000785400) github.com/spf13/cobra@v1.2.1/command.go:974 +0x3bc github.com/spf13/cobra.(*Command).Execute(...) github.com/spf13/cobra@v1.2.1/command.go:902 main.main.func1(0xc000556500) github.com/openshift/origin/cmd/openshift-tests/openshift-tests.go:84 +0x8a main.main() github.com/openshift/origin/cmd/openshift-tests/openshift-tests.go:85 +0x3b6 [AfterEach] [sig-arch][bz-etcd][Late] Alerts github.com/openshift/origin/test/extended/util/client.go:151 [AfterEach] [sig-arch][bz-etcd][Late] Alerts github.com/openshift/origin/test/extended/util/client.go:152 fail [github.com/openshift/origin/test/extended/prometheus/alerts.go:24]: Apr 17 14:16:52.778: etcdHighNumberOfFailedGRPCRequests was at or above info for at least 59s on platformidentification.JobType{Release:"4.11", FromRelease:"", Platform:"azure", Architecture:"amd64", Network:"ovn", Topology:"ha"} (maxAllowed=3s): pending for 28m54s, firing for 59s: Apr 17 13:41:42.000 - 59s W alert/etcdHighNumberOfFailedGRPCRequests node/10.0.0.6:9979 ns/openshift-etcd pod/etcd-ci-op-sxcpdb36-5cb9e-xmzh5-master-1 ALERTS{alertname="etcdHighNumberOfFailedGRPCRequests", alertstate="firing", endpoint="etcd-metrics", grpc_method="Watch", grpc_service="etcdserverpb.Watch", instance="10.0.0.6:9979", job="etcd", namespace="openshift-etcd", pod="etcd-ci-op-sxcpdb36-5cb9e-xmzh5-master-1", prometheus="openshift-monitoring/k8s", service="etcd", severity="warning"} failed: (2.3s) 2022-04-17T14:16:52 "[sig-arch][bz-etcd][Late] Alerts alert/etcdHighNumberOfFailedGRPCRequests should not be at or above info [Suite:openshift/conformance/parallel]"
link to this job's testgrid for reference.
- duplicates
-
SDN-3060 etcdHighNumberOfLeaderChanges alert firing for extended period
- Closed