Loading...

XML

Word

Printable

Type: Bug
Resolution: Won't Do
Priority: Normal
Fix Version/s: 4.21.0
Affects Version/s: 4.15.0
Component/s: kube-apiserver
Labels:
- api
- rits-work

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
Low
Regression:
No

Target Backport Versions:
None
Target Version:

4.21.0
Release Blocker:
None
Sprint:
None

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

PX Priority Data:
PX Impact Score:

Release Note Status:
None
Release Note Type:
None
Release Note Text:
None

Escape Reason:
None
Escape Impact:
None
Corrective Measures:
None
SDLC stage when should've been found:
None

Description of problem:

The alert KubeAPIDown is not reflecting what really the alert means. 

In the summary, it's indicated that 

  "Target disappeared from Prometheus target discovery"

but really, the only values are 0 or 1 for this alert, then, if you bring down a kubeapi server, this alert is not triggered.

Version-Release number of selected component (if applicable):

4.13 and previous

How reproducible:

Always. Shutting down a master node

Steps to Reproduce:

1. Shutdown a master node
2. Wait and check the alert KubeAPIDown never is triggered since it's 1 or 2 and 2 apiserver are up.

Actual results:

The alert is not reflecting the summary:

  "Target disappeared from Prometheus target discovery"

Expected results:

The alert is triggered if we respect the summary and the alertname. Then 2 possible options that they could go together:

1. Create an alert that reflects the status of the kubeapi server pods. Then, a apiserver pod is down, it's reflected as a degraded status
2. Modify the current alert to the name summary to reflect the really monitored

Additional info: