Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Critical
Fix Version/s: odf-4.19.9
Affects Version/s: odf-4.17
Component/s: ocs-operator
Labels:
None

Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Bugzilla Bug:
RHBZ: 2314998
Current Status:
On Track
Dev Approval:
Committed
Prod build version:
4.19.9-1.konflux
QE Approval:
Committed
Release Note Type:
Release Note Not Required
Target Release:

odf-4.19.9
Git Pull Request:
https://github.com/red-hat-storage/ocs-operator/pull/2892, https://github.com/red-hat-storage/ocs-operator/pull/2823
Intelligence Requested:
Market:

Target Version:

odf-4.19.9

Regression:
None

SFDC Cases Links:
SFDC Cases Open:
SFDC Cases Counter:

Description of problem (please be detailed as possible and provide log
snippests):
During the test execution of the test_mds_cache_alert_with_active_node_drain we were running metadata io with cephfs by steps:
1. Create PVC with Cephfs, access mode RWX
2. Create dc pod with Fedora image
3. Copy helper_scripts/meta_data_io.py to Fedora dc pod
4. Run meta_data_io.py on fedora pod
script can be found by link https://github.com/red-hat-storage/ocs-ci/blob/e4bcbb284280862d03b7f6b5ab2b40e2727482f3/ocs_ci/templates/workloads/helper_scripts/meta_data_io.py

This script triggers high cache usage in scenario when standby-replay mds scaled down, but does not trigger when active node drained, showing the problem is related to active mds node disruption happens

Version of all relevant components (if applicable):
OC version:
Client Version: 4.16.11
Kustomize Version: v5.0.4-0.20230601165947-6ce0bf390ce3
Server Version: 4.16.12
Kubernetes Version: v1.29.8+f10c92d

OCS version:
ocs-operator.v4.16.2-rhodf OpenShift Container Storage 4.16.2-rhodf ocs-operator.v4.16.1-rhodf Succeeded

ODF operator full version: 4.16.2-4

Cluster version:
NAME VERSION AVAILABLE PROGRESSING SINCE STATUS
version 4.16.12 True False 12h Error while reconciling 4.16.12: the cluster operator insights is not available

Does this issue impact your ability to continue to work with the product
(please explain in detail what is the user impact)?
pottentially

Is there any workaround available to the best of your knowledge?
no

Rate from 1 - 5 the complexity of the scenario you performed that caused this
bug (1 - very simple, 5 - very complex)?
3

Can this issue reproducible?
1/1

Can this issue reproduce from the UI?
no

If this is a regression, please provide more details to justify this:
new deployment. Tech preview

Steps to Reproduce:
1. Deploy ROSA HCP cluster with ODF and run test_mds_cache_alert_with_active_node_drain
2.
3.

Actual results:
There was not found alert MDSCacheUsageHigh

Expected results:
MDSCacheUsageHigh is fired when conditions met

Additional info:
cluster to capture necessary data will be created upon request to qe

is cloned by

DFBUGS-1474 [2314998] [ODF on ROSA HCP] [4.17] MDSCacheUsageHigh not found with active node drained

Closed

links to

DFBUGS-370: [release-4.19] Create ClusterRoleBinding to fetch in-cluster monitoring metrics

red-hat-storage/ocs-operator#2892: Create ClusterRoleBinding to fetch in-cluster monitoring metrics

Assignee:: Kaustav Majumder

Reporter:: Daniel Osypenko

Need Info From:: Kaustav Majumder

QA Contact:: Daniel Osypenko

Votes:: 0 Vote for this issue

Watchers:: 16 Start watching this issue

Created:: 2024/09/26 8:06 PM

Updated:: 2025/12/22 2:32 PM

Resolved:: 2025/12/22 2:32 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates

PagerDuty