Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Normal
Fix Version/s: None
Affects Version/s: 4.12
Component/s: Compliance Operator
Labels:
- ISC_PXE

Regression:
None
Story Points:
2
Sprint:
CMP Sprint 61
sprint_count:
1
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Target Version:

4.14.0

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Description of problem:

The compliance_operator_compliance_scan_error_total metric has an "error" label which will have a different value every time a different error message is seen. This goes against good instrumentation practices because
1) it is impossible to predict the cardinality of the metric (e.g. how many label key/value combinations could exist at the same time for the compliance_operator_compliance_scan_error_total metric). Unbounded metrics like this can put lots of memory load on Prometheus.
2) the label value can be very long, leading to issues when users want to push the metric to other systems (see https://issues.redhat.com/browse/OBSDA-205).
And in practice, alerting on the metric is likely to be complicated.

Version-Release number of selected component (if applicable):

4.12 (but probably applies to previous versions too)

How reproducible:

Always

Steps to Reproduce:

1. Trigger a scan that will fail.
2. Go to OCP console > metrics page and query "compliance_operator_compliance_scan_error_total".
3.

Actual results:

compliance_operator_compliance_scan_error_total metric with an "error" label containing an error message.

Expected results:

No "error" label.

Additional info:

https://prometheus.io/docs/practices/naming/#labels
https://github.com/openshift/compliance-operator/blob/master/doc/usage.md#metrics
https://issues.redhat.com/browse/OBSDA-205

is triggered by

OBSDA-205 [FEATURE] allow to configure enforced limits on PrometheusK8sConfig

Closed

links to

ComplianceAsCode/compliance-operator#223: OCPBUGS-1803: Remove compliance_operator_compliance_scan_error_total …

mentioned on

Merge request - Updated US source to: c922d65 Merge branch 'release-v1.1.0' into ocp-0.1

Merge request - Updated US source to: c278069 Release v1.0.0

Assignee:: Lance Bragstad

Reporter:: Simon Pasquier

QA Contact:: Xiaojie Yuan

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Created:: 2022/09/28 1:26 PM

Updated:: 2023/06/09 7:06 AM

Resolved:: 2023/04/17 2:11 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates