-
Bug
-
Resolution: Duplicate
-
Undefined
-
None
-
4.18.z
-
None
-
None
-
False
-
-
None
-
None
-
None
-
None
-
None
-
Rejected
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
Description of problem:
Redhat-operator pods are failing regularly due to startup probe timing with the following events:
1h17m Normal Pulled pod/redhat-operators-vhr9q Successfully pulled image "registry.redhat.io/redhat/redhat-operator-index:v4.18" in 4.205s (4.205s including waiting). Image size: 1706887383 bytes.
1h17m Normal Created pod/redhat-operators-vhr9q Created container: extract-content
1h17m Normal Started pod/redhat-operators-vhr9q Started container extract-content
1h17m Normal Pulling pod/redhat-operators-vhr9q Pulling image "quay.io/openshift-release-dev/ocp-v4.0-art-dev@sha256:a2ed3a56ac3e411dffa5a6d960e8ab570b62cc00a560c485d3eb5c4eb34c9cc5"
1h17m Normal Pulled pod/redhat-operators-vhr9q Successfully pulled image "quay.io/openshift-release-dev/ocp-v4.0-art-dev@sha256:a2ed3a56ac3e411dffa5a6d960e8ab570b62cc00a560c485d3eb5c4eb34c9cc5" in 2.41s (2.41s including waiting). Image size: 911296197 bytes.
1h17m Normal Created pod/redhat-operators-vhr9q Created container: registry-server
1h17m Normal Started pod/redhat-operators-vhr9q Started container registry-server
1h17m Warning Unhealthy pod/redhat-operators-vhr9q Startup probe failed: timeout: failed to connect service ":50051" within 1s
1h17m Normal Killing pod/redhat-operators-vhr9q Stopping container registry-server
19m Normal Pulling pod/redhat-operators-x2xq7 Pulling image "quay.io/openshift-release-dev/ocp-v4.0-art-dev@sha256:a2ed3a56ac3e411dffa5a6d960e8ab570b62cc00a560c485d3eb5c4eb34c9cc5"
19m Normal Created pod/redhat-operators-x2xq7 Created container: registry-server
19m Normal Started pod/redhat-operators-x2xq7 Started container registry-server
19m Warning Unhealthy pod/redhat-operators-x2xq7 Liveness probe failed: command timed out
19m Warning Unhealthy pod/redhat-operators-x2xq7 Readiness probe failed: command timed out
19m Normal Killing pod/redhat-operators-x2xq7 Container registry-server failed liveness probe, will be restarted
2h22m Warning Unhealthy pod/redhat-operators-x2xq7 Readiness probe errored: rpc error: code = Unknown desc = command error: cannot register an exec PID: container is stopping, stdout: , stderr: , exit code -1
2h22m Normal Pulled pod/redhat-operators-x2xq7 Successfully pulled image "quay.io/openshift-release-dev/ocp-v4.0-art-dev@sha256:a2ed3a56ac3e411dffa5a6d960e8ab570b62cc00a560c485d3eb5c4eb34c9cc5" in 2.291s (2.291s including waiting). Image size: 911296197 bytes.
2h22m Warning Unhealthy pod/redhat-operators-x2xq7 Readiness probe errored: rpc error: code = NotFound desc = container is not created or running: checking if PID of e7d641a2428745ec023c30e6df52dbb05a54a638789c39daf80e76550b38a26f is running failed: container process not found
18m Warning Unhealthy pod/redhat-operators-x2xq7 Startup probe failed: command timed out
19m Normal Pulled pod/redhat-operators-x2xq7 Successfully pulled image "quay.io/openshift-release-dev/ocp-v4.0-art-dev@sha256:a2ed3a56ac3e411dffa5a6d960e8ab570b62cc00a560c485d3eb5c4eb34c9cc5" in 2.632s (2.632s including waiting). Image size: 911296197 bytes.
Packageserver pod of openshfit-operator-lifecycle-manager namespace is also flooded with similar logs:
2026-02-26T05:59:37.438084601Z time="2026-02-26T05:59:37Z" level=info msg="updating PackageManifest based on CatalogSource changes: {certified-operators openshift-marketplace}" action="sync catalogsource" address="certified-operators.openshift-marketplace.svc:50051" name=certified-operators namespace=openshift-marketplace
2026-02-26T05:59:37.499731503Z time="2026-02-26T05:59:37Z" level=info msg="updating PackageManifest based on CatalogSource changes: {redhat-marketplace openshift-marketplace}" action="sync catalogsource" address="redhat-marketplace.openshift-marketplace.svc:50051" name=redhat-marketplace namespace=openshift-marketplace
2026-02-26T05:59:39.722580148Z time="2026-02-26T05:59:39Z" level=info msg="updating PackageManifest based on CatalogSource changes: {redhat-operators openshift-marketplace}" action="sync catalogsource" address="redhat-operators.openshift-marketplace.svc:50051" name=redhat-operators namespace=openshift-marketplace
2026-02-26T06:04:45.147891646Z time="2026-02-26T06:04:45Z" level=info msg="updating PackageManifest based on CatalogSource changes: {redhat-marketplace openshift-marketplace}" action="sync catalogsource" address="redhat-marketplace.openshift-marketplace.svc:50051" name=redhat-marketplace namespace=openshift-marketplace
2026-02-26T06:04:53.560728676Z time="2026-02-26T06:04:53Z" level=info msg="updating PackageManifest based on CatalogSource changes: {certified-operators openshift-marketplace}" action="sync catalogsource" address="certified-operators.openshift-marketplace.svc:50051" name=certified-operators namespace=openshift-marketplace
2026-02-26T06:06:58.467134497Z time="2026-02-26T06:06:58Z" level=info msg="updating PackageManifest based on CatalogSource changes: {community-operators openshift-marketplace}" action="sync catalogsource" address="community-operators.openshift-marketplace.svc:50051" name=community-operators namespace=openshift-marketplace
2026-02-26T06:11:09.323571467Z time="2026-02-26T06:11:09Z" level=info msg="updating PackageManifest based on CatalogSource changes: {redhat-operators openshift-marketplace}" action="sync catalogsource" address="redhat-operators.openshift-marketplace.svc:50051" name=redhat-operators namespace=openshift-marketplace
Version-Release number of selected component (if applicable):
How reproducible:
Not able to reproduce
Steps to Reproduce:
1.
2.
3.
Actual results:
Expected results:
Additional info:
- is duplicated by
-
OCPBUGS-78095 Redhat-operators pods of openshift-marketplace namespace are failing regularly due to startup probe timing out in a disconnected environment and packageserver pod is flooding with similar logs
-
- New
-