Loading...

XML

Word

Printable

Type: Bug
Resolution: Unresolved
Priority: Normal
Fix Version/s: None
Affects Version/s: None
Component/s: infra-operator
Labels:
None

Epic Link:
Eventing for Operators
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Dev Approval:
?
Docs Approval:
?
PM Approval:
?
QE Approval:
?
Regression:
None
Intelligence Requested:
Market:

Severity:
Moderate

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

I created a similar ticket already for Telemetry, it's actually the same request for all components, but just to illustrate my point:

[root@jumphost 05_control_plane]# oc get openstackcontrolplane
NAME                      STATUS   MESSAGE
openstack-control-plane   False    OpenStackControlPlane DNS in progress
[root@jumphost 05_control_plane]# oc get events
LAST SEEN   TYPE     REASON    OBJECT             MESSAGE
6m22s       Normal   Killing   pod/ceilometer-0   Stopping container ceilometer-central-agent
6m22s       Normal   Killing   pod/ceilometer-0   Stopping container sg-core
6m22s       Normal   Killing   pod/ceilometer-0   Stopping container ceilometer-notification-agent
6m22s       Normal   Killing   pod/ceilometer-0   Stopping container proxy-httpd
[root@jumphost 05_control_plane]#

# oc describe openstackcontrolplane | less
(...)
  Dns:
    Enabled:  true
    Template:
      Container Image:                registry.redhat.io/rhoso/openstack-neutron-server-rhel9@sha256:3c49822b33a4d9b05ee9946ca92923d6697c7c66787a02c69a8420ba2de94778
      Dns Data Label Selector Value:  dnsdata
      Options:
        Key:  server
        Values:
          8.8.8.8
        Key:  server
        Values:
          8.8.4.4
      Override:
        Service:
          Metadata:
            Annotations:
              metallb.universe.tf/address-pool:     ctlplane
              metallb.universe.tf/allow-shared-ip:  ctlplane
              metallb.universe.tf/loadBalancerIPs:  172.20.1.80
          Spec:
            Type:  LoadBalancer
      Replicas:    2
(...)
    Message:               OpenStackControlPlane DNS in progress
    Reason:                Requested
    Severity:              Info
    Status:                False
    Type:                  Ready
    Last Transition Time:  2024-09-08T16:34:34Z
(...)

I mentioned in the other ticket that it's an anti-pattern in OpenShift to log issues into pod errors only, and here's why:
I as a user of the OpenStack operator have no idea which one of the many operator-pods is actually responsible for driving forward the DNS progress. So I have to do something as absurd as:

# oc get pods -n openstack-operators -o name | while read p; do echo "=== $p ==="; oc logs -n openstack-operators $p | grep -i dns; done | less

.. in order to figure out that the the infra-operator-controller-manager is responsible for this. Please make sure to update the OpenStackControlPlane CR with the actual status and failures, and please emit events as well

Thanks!

Assignee:: Unassigned

Reporter:: Andreas Karis

Team:: rhos-dfg-ospk8s

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Created:: 2024/09/08 4:58 PM

Updated:: 2024/10/23 1:16 PM

Details

Description

Attachments

Easy Agile Planning Poker

Activity

People

Dates

PagerDuty