-
Story
-
Resolution: Done
-
Blocker
-
None
-
None
-
False
-
None
-
False
-
-
Began sometime today or late yesterday, affects multiple clouds and is blocking payloads with most jobs failing.
[sig-instrumentation] Prometheus [apigroup:image.openshift.io] when installed on the cluster shouldn't report any alerts in firing state apart from Watchdog and AlertmanagerReceiversNotConfigured [Early][apigroup:config.openshift.io] [Skipped:Disconnected] [Suite:openshift/conformance/parallel] expand_less Run #0: Failed expand_less 1m0s { "service": "machine-config-daemon", "severity": "warning" }, "value": [ 1689134505.447, "1" ] }, { "metric": { "__name__": "ALERTS", "alertname": "KubeletHealthState", "alertstate": "firing", "container": "oauth-proxy", "endpoint": "metrics", "instance": "10.0.0.8:9001", "job": "machine-config-daemon", "namespace": "openshift-machine-config-operator", "node": "ci-op-fbh5bhvb-ed2ea-xdsvq-master-1", "pod": "machine-config-daemon-f5wmt", "prometheus": "openshift-monitoring/k8s", "service": "machine-config-daemon", "severity": "warning" }, "value": [ 1689134505.447, "1" ] },
Being discussed here: https://redhat-internal.slack.com/archives/C01CQA76KMX/p1689169944865979
At present we can find no changes in this payload that weren't in previous that did not exhibit the issue.
The new rhcos version seems fine in ci payloads.
Problem began surfacing for MCO in their presubmits a few days prior.
- depends on
-
OCPBUGS-16128 4.13/4.14 MCDs do not work with FIPS enabled golang builders
- Closed
- links to