Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-10211

PTPOperator config transportHost with AMQ is not detected

XMLWordPrintable

    • Critical
    • No
    • Proposed
    • False
    • Hide

      None

      Show
      None

      This is a clone of issue OCPBUGS-7823. The following is the description of the original issue:

      This is a clone of issue OCPBUGS-7728. The following is the description of the original issue:

      This is a clone of issue OCPBUGS-7611. The following is the description of the original issue:

      Description of problem:

      ptpoperator not finding valid amq router also Changing ptpoperatorconfig to update amq transport host , ignores AMQ check since the previous AMQ check had failed.

      FIX needed along with proper detection of amq-router : For every update on ptpoperatorconfig , the check flag should reset and treat it as new deployment.

      deploy ptp operator with HTTP or with invalid amq . Switching back to AMQ with valid service name and namespace is not detecting AMQ as installed and running .Also during update of ptpoperatorconfig , time.Sleep(retryTimeout) delays any further reconcile. unless 3 minutes are passed

      Version-Release number of selected component (if applicable):

      4.13

      How reproducible:

      oc apply -n openshift-ptp
      ----------------------------------
      apiVersion: ptp.openshift.io/v1
      kind: PtpOperatorConfig
      metadata:
        name: default
      spec:
        daemonNodeSelector: {}
        ptpEventConfig:
          enableEventPublisher: true
          transportHost: "amqp://amq-router.amq-router.svc.cluster.local"
      

       

       

      Steps to Reproduce:

      1.Install amq router via operator( Manual installation won't detect AMQ since the check method depends the service Account  named as service name) 2.Apply ptp config and wait for the daemonset to deploy 
      3. Apply amq transport host and watch ptpoperator logs 
      4.The reconcile get stuck for 3 minutes trying to retry on AMQ instance check
      3.
      

      Actual results:

      AMQ instance not detected  

      Expected results:

      AMQ instance detected 

      Additional info:

      1. The waiting for 3 minutes is causing other reconcile loop to stop
      should be a thread with channel so when you see reapply ptpoperatorconfig the old retry gets cancelled also able to reconcile ptpconfig changes
      2. I have the service running but it is not detecting
      oc get service -n amq-router
      NAME         TYPE        CLUSTER-IP     EXTERNAL-IP   PORT(S)    AGE
      amq-router   ClusterIP   172.30.54.49   <none>        5672/TCP   5m10s

      1 ptpoperatorconfig_controller.go:205] check AMQP service amq-router in namespace amq-router before deploying linuxptp daemon I0216 16:01:44.058600       1 ptpoperatorconfig_controller.go:325] AMQP service amq-router in namespace amq-router is NOT running, retry every 500ms for 1s

              aputtur@redhat.com Aneesh Puttur
              openshift-crt-jira-prow OpenShift Prow Bot
              Ofer Bochan Ofer Bochan
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

                Created:
                Updated:
                Resolved: