Loading...

XML

Word

Printable

Type: Bug
Resolution: Done-Errata
Priority: Normal
Fix Version/s: None
Affects Version/s: 4.18
Component/s: Machine Config Operator
Labels:

Severity:
Important
Regression:
None
Story Points:
3
Sprint:
MCO Sprint 261, MCO Sprint 262
sprint_count:
2
Release Blocker:
Rejected
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Release Note Text:
N/A
Release Note Type:
Release Note Not Required
Release Note Status:
In Progress
Target Version:

4.18.0

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Description of problem:

The pinned images functionality is not working

Version-Release number of selected component (if applicable):

IPI on AWS version:
$ oc get clusterversion
NAME      VERSION                              AVAILABLE   PROGRESSING   SINCE   STATUS
version   4.18.0-0.nightly-2024-10-28-052434   True        False         6h46m   Cluster version is 4.18.0-0.nightly-2024-10-28-052434

How reproducible:

Always

Steps to Reproduce:

    1. Enable techpreview
    2.  Create a pinnedimagesets resource

$ oc create -f - << EOF
apiVersion: machineconfiguration.openshift.io/v1alpha1
kind: PinnedImageSet
metadata:
  labels:
    machineconfiguration.openshift.io/role: worker
  name: tc-73623-worker-pinned-images
spec:
  pinnedImages:
  - name: "quay.io/openshifttest/busybox@sha256:0415f56ccc05526f2af5a7ae8654baec97d4a614f24736e8eef41a4591f08019"
  - name: quay.io/openshifttest/alpine@sha256:be92b18a369e989a6e86ac840b7f23ce0052467de551b064796d67280dfa06d5
EOF

Actual results:

The images are not pinned and the pool is degraded

We can see these logs in the MCDs

 I1028 14:26:32.514096    2341 pinned_image_set.go:304] Reconciling pinned image set: tc-73623-worker-pinned-images: generation: 1
E1028 14:26:32.514183    2341 pinned_image_set.go:240] failed to get image status for "quay.io/openshifttest/busybox@sha256:0415f56ccc05526f2af5a7ae8654baec97d4a614f24736e8eef41a4591f08019": rpc error: code = Unavailable desc = name resolver error: produced zero addresses

And we can see the machineconfignodes resources reporting pinnedimagesets degradation:

  - lastTransitionTime: "2024-10-28T14:27:58Z"
    message: 'failed to get image status for "quay.io/openshifttest/busybox@sha256:0415f56ccc05526f2af5a7ae8654baec97d4a614f24736e8eef41a4591f08019":
      rpc error: code = Unavailable desc = name resolver error: produced zero addresses'
    reason: PrefetchFailed
    status: "True"
    type: PinnedImageSetsDegraded

Expected results:

The images should be pinned without errors.

Additional info:


Slack conversation: https://redhat-internal.slack.com/archives/C02CZNQHGN8/p1730125766377509

This is Sam's guess (thank you [~rh-ee-sbatsche] for your quick help, I really appreciate it):
My guess is that it is related to https://github.com/openshift/machine-config-operator/pull/4629
Specifically the changes to pkg/daemon/cri/cri.go where we swapped out DialContext for NewClient. Per docs.
One subtle difference between NewClient and Dial and DialContext is that the former uses "dns" as the default name resolver, while the latter use "passthrough" for backward compatibility. This distinction should not matter to most users, but could matter to legacy users that specify a custom dialer and expect it to receive the target string directly.

relates to

MCO-1285 Update MCO dependencies to Kubernetes 1.31

Closed

MCO-1286 pre-merge testing: Update MCO dependencies to Kubernetes 1.31

Closed

links to

openshift/machine-config-operator#4668: OCPBUGS-43893: Pinned images not working

RHEA-2024:6122 OpenShift Container Platform 4.18.z bug fix update

Assignee:: David Joshy

Reporter:: Sergio Regidor de la Rosa

QA Contact:: Sergio Regidor de la Rosa

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Created:: 2024/10/28 4:22 PM

Updated:: 2025/02/27 2:49 PM

Resolved:: 2025/02/25 4:50 AM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates