Loading...

XML

Word

Printable

Type: Bug
Resolution: Won't Do
Fix Version/s: None
Affects Version/s: 4.11
Component/s: Multi-Arch
Labels:
None

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
None

Target Version:
None
Release Blocker:
None
Sprint:
None

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Release Note Status:
None
Release Note Type:
None
Release Note Text:
None

Note:
1. if you are dealing with Machines or MachineSet objects, please select the component as "Cloud Compute" under same product.
2. if you are dealing with kubelet / kubeletconfigs / container runtime configs, please select the component as "Node" under same product.

Description of problem:
After deploying OCP on ppc64le environment, when I run:
[root@rdr-sri-7cfc-tok04-bastion-0 ~]# oc get MachineConfigPool worker
NAME CONFIG UPDATED UPDATING DEGRADED MACHINECOUNT READYMACHINECOUNT UPDATEDMACHINECOUNT DEGRADEDMACHINECOUNT AGE
worker rendered-worker-627087f67ace055b92ca4c26488fdeb7 False True False 3 2 2 0 18h

I expect to see READYMACHINECOUNT be 3 but it is 2.

Then I stopped kubelet.service on one of the nodes (worker). The count became:

[root@rdr-sri-7cfc-tok04-bastion-0 ~]# oc get MachineConfigPool worker
NAME CONFIG UPDATED UPDATING DEGRADED MACHINECOUNT READYMACHINECOUNT UPDATEDMACHINECOUNT DEGRADEDMACHINECOUNT AGE
worker rendered-worker-627087f67ace055b92ca4c26488fdeb7 False True False 3 1 2 0 18h

READYMACHINECOUNT became 1. When I started the service back, the COUNT got back to 2.

Version-Release number of MCO (Machine Config Operator) (if applicable): 4.11

Platform (AWS, VSphere, Metal, etc.): IBM Power

Are you certain that the root cause of the issue being reported is the MCO (Machine Config Operator)?
(Y/N/Not sure): Yes as I am looking at MachineConfigPool.

How reproducible:

Did you catch this issue by running a Jenkins job? If yes, please list:
1. Jenkins job:

2. Profile:

Steps to Reproduce:
1. As described above.
2.
3.

Actual results:

Expected results:

Additional info:

1. Please consider attaching a must-gather archive (via oc adm must-gather). Please review must-gather contents for sensitive information before attaching any must-gathers to a Bugzilla report. You may also mark the bug private if you wish.

2. If a must-gather is unavailable, please provide the output of:

$ oc get co machine-config -o yaml

$ oc get mcp (and oc describe mcp/${degraded_pool} if pools are degraded)

$ oc get mc

$ oc get pod -n openshift-machine-config-operator

$ oc get node -o wide

3. If a node is not accessible via API, please provide console/journal/kubelet logs of the problematic node

4. Are there RHEL nodes on the cluster? If yes, please upload the whole Ansible logs or Jenkins job

external trackers

Red Hat Issue Tracker MULTIARCH-2667

Assignee:: Jeremy Poulin

Reporter:: Sridhar Venkat (Inactive)

Contributors:: None

Architect:: None

QA Contact:: None

Doc Contact:: None

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Created:: 2022/06/27 2:14 PM

Updated:: 2025/07/29 5:46 PM

Resolved:: 2022/07/01 2:45 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates