Uploaded image for project: 'Red Hat Data Grid'
  1. Red Hat Data Grid
  2. JDG-6462

[Operator] Cluster is not being formed when cross site configured

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Done
    • Icon: Blocker Blocker
    • RHDG 8.4.4 GA
    • RHDG 8.4.x CD
    • Operator
    • None
    • False
    • None
    • False

      Cluster is not being formed when cross site is configured (Cross-site itself is formed on both nodes). Server logs of one cluster are attached. **

      Weird part is status message for WellFormed condition:

      status: 
        conditions: 
          - status: 'True'
            type: PreliminaryChecksPassed
          - status: 'True'
            type: GossipRouterReady
          - message: >-
              Views: 
              test-cross-site-view-internal-multi-pod-xsite1-0-23266,test-cross-site-view-internal-multi-pod-xsite1-1-19706
            status: 'False'
            type: WellFormed
      

      OpenShift: ** 4.13 on AWS

      Reproducers:

      • TestCrossSiteViewInternalMultiPod
      • TestDefaultTLSInternalMultiPod

      JGroups config:

      <jgroups>
          <stack name="image-tcp" extends="tcp">
              <TCP bind_addr="${jgroups.bind.address:SITE_LOCAL}"
                   bind_port="${jgroups.bind.port,jgroups.tcp.port:7800}"
                   diag.enabled="false"
                   port_range="0"
              />
              <dns.DNS_PING dns_query="test-cross-site-view-internal-multi-pod-xsite1-ping.datagrid-xsite2.svc.cluster.local"
                            dns_record_type="A"
                            stack.combine="REPLACE" stack.position="MPING"/>
          </stack>
           
          <stack name="relay-tunnel" extends="udp">
              <TUNNEL
                  bind_addr="${jgroups.relay.bind.address:SITE_LOCAL}"
                  bind_port="${jgroups.relay.bind.port:0}"
                  gossip_router_hosts="test-cross-site-view-internal-multi-pod-xsite1-site[7900],test-cross-site-view-internal-multi-pod-xsite2-site.datagrid-xsite2.svc.cluster.local[7900]"
                  diag.enabled="false"
                  port_range="0"
                  
                  stack.combine="REPLACE"
                  stack.position="UDP"
              />
              <!-- we are unable to use FD_SOCK with openshift -->
              <!-- otherwise, we would need 1 external service per pod -->
              <FD_SOCK2 stack.combine="REMOVE"/>
          </stack>
          <stack name="xsite" extends="image-tcp">
              <relay.RELAY2 xmlns="urn:org:jgroups" site="xsite1" max_site_masters="2" />
              <remote-sites default-stack="relay-tunnel">
                  <remote-site name="xsite1"/>
              
                  <remote-site name="xsite2"/>
              </remote-sites>
          </stack>
      </jgroups>
      

        1. server0.log
          6 kB
          Pavel Drobek
        2. server1.log
          6 kB
          Pavel Drobek

              pruivo@redhat.com Pedro Ruivo
              pdrobek@redhat.com Pavel Drobek (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

                Created:
                Updated:
                Resolved: