-
Bug
-
Resolution: Unresolved
-
Normal
-
4.10.z
-
Quality / Stability / Reliability
-
False
-
-
3
-
None
-
None
-
None
-
None
-
MON Sprint 278
-
1
-
In Progress
-
Bug Fix
-
-
None
-
None
-
None
-
None
Description of problem:
KubeAggregatedAPIErrors was alerting based on the number of errors returned by all the instances of an aggregated API which made it more likely to fire the more instances the API had. To improve that we need to alert at the instance level rather than at the API level by changing the aggregation function from a sum to a max.
Version-Release number of selected component (if applicable):
How reproducible:
Steps to Reproduce:
1. 2. 3.
Actual results:
Expected results:
Additional info:
Setting affected version to 4.10.z, since that's as far back as it goes, but this first surfaced in 4.1 (Feb, 2020): https://github.com/dgrisonnet/kubernetes-mixin/commit/966ce6f2a8ce7ceedad32a68e991d13d4ee8474e
- clones
-
OCPBUGS-60691 Improve KubeAggregatedAPIErrors alert in high availability scenarios
-
- Verified
-
- depends on
-
OCPBUGS-35095 `KubeCPUOvercommit` Alert Not Triggered Despite Node CPU is Overcommitment
-
- Verified
-
- links to