Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Major
Fix Version/s: RHODS_1.11.0_GA
Affects Version/s: RHODS_1.6.0_GA
Component/s: UI
Labels:
- UI
- groomed

Story Points:
5
Blocked:
False
Ready:
False
Acceptance Criteria:
None
Automated:
Yes
CDW devel_ack:
CDW docs_ack:
CDW pm_ack:
CDW qa_ack:
CDW release:
Fixed in Build:
1.11.0-3
Regression:
No
Target Release:

RHODS_1.11.0_GA
Test Blocker:
No
Test Coverage:

Yes
Watchlist Impact:
None
Git Pull Request:
https://github.com/red-hat-data-services/odh-dashboard/pull/206, https://github.com/red-hat-data-services/odh-manifests/pull/217

Sprint:
RHODS 1.11, RHODS 1.12
Severity:
Important

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Description of problem:

On clusters with very high resoruces, ODH Dashboard is causing high memory usage on openshift-kube-apiserver. This is creating memory pressure on OpenShift master nodes where those pods reside, rendering the cluster unreachable if it uses all available memory on the node.

Performance test was the default toolchain-e2e test for the sandbox. This will create lots of users, namespaces, and users.

https://github.com/codeready-toolchain/toolchain-e2e/tree/master/setup

sum(container_memory_usage_bytes{namespace="openshift-kube-apiserver", pod=~"kube-apiserver-.*"}) amounted to 360GB of ram and individual pods were over 60GB.

https://docs.google.com/spreadsheets/d/1RKt9jtTtv4Ft3ZlVk4sszC-BrF_0dbC46_FNHiLTaI0/edit#gid=0

Prerequisites (if any, like setup, operators/versions):

Steps to Reproduce (

create cluster with master nodes m5.8xlarge (https://github.com/codeready-toolchain/toolchain-e2e/tree/master/setup#prereqs)
install RHODS
Set up the sandbox operators (https://github.com/codeready-toolchain/toolchain-e2e/tree/master/setup#dev-sandbox-setup-1)
run the tests (https://github.com/codeready-toolchain/toolchain-e2e/tree/master/setup#provisioning-test-users-and-capturing-metrics)
for 1 user, and for 2000 users and compare results
example:
go run setup/main.go -users 2000 --default 2000 --custom 0 --username "user${RANDOM_NAME}" --workloads redhat-ods-operator:rhods-operator --workloads redhat-ods-applications:rhods-dashboard --workloads redhat-ods-operator:cloud-resource-operator --workloads redhat-ods-monitoring:blackbox-exporter --workloads redhat-ods
monitoring:grafana --workloads redhat-ods-monitoring:prometheus
view the results