Uploaded image for project: 'Debezium'
  1. Debezium
  2. DBZ-7868

MongoDB connector works slowly with SHARDED mode

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Cannot Reproduce
    • Icon: Major Major
    • None
    • 2.5.0.Final
    • mongodb-connector
    • None
    • False
    • None
    • False
    • Important

      In order to make your issue reports as actionable as possible, please provide the following information, depending on the issue type.

      Bug report

      For bug reports, provide this information, please:

      What Debezium connector do you use and what version?

      Currently we use debezium 2.5.0.Final

      What is the connector configuration?

       

      {
        "connector.class": "io.debezium.connector.mongodb.MongoDbConnector",
        "topic.prefix": "test_mongo_debezium",
        "collection.include.list": "public.test_debezium",
        "transforms.RenameKeyField.type": "org.apache.kafka.connect.transforms.ReplaceField$Key",
        "topic.creation.default.partitions": "-1",
        "mongodb.connection.string": "mongodb://admin:admin@mongos.mongodb:27017/public?authSource=admin",
        "schema.name.adjustment.mode": "avro",
        "transforms": "RenameKeyField",
        "topic.creation.default.replication.factor": "-1",
        "name": "mongodb-source-connector",
        "transforms.RenameKeyField.renames": "id:_id"
      }

       

      What is the captured database version and mode of depoyment?

      (E.g. on-premises, with a specific cloud provider, etc.)

      Mongo 7.0.5 (Local deployment), Kubernetes v1.28.4

      What behaviour do you expect?

      We expect it to work much faster, as we had it with debezium 2.3.0 and Mongo 4

      What behaviour do you see?

      We get a big lag between inset/update operations in MongoDB sending this data into Kafka, see following data:

      update in mongo   13:08:05 GMT
      put to topic      13:08:13.482Z
      write to postgres 16:08:13.924 +0300
      
      insert in mongo   13:09:12 GMT
      put to topic      13:09:19.089Z
      write to postgres 16:09:19.479 +0300  

      Do you see the same behaviour using the latest relesead Debezium version?

      (Ideally, also verify with latest Alpha/Beta/CR version)

      We hadn't updated to latest's versions yet

      Do you have the connector logs, ideally from start till finish?

      (You might be asked later to provide DEBUG/TRACE level log)

      I've enabled TRACE logs for MongoDB, but i can only see this, and this appears continuously.

       

      [2024-05-13T09:58:25.233][INFO][request_id= ][tenant_id= ][thread=DistributedHerder-connect-1-1][class=org.apache.kafka.connect.runtime.distributed.DistributedHerder][method=startWork] [Worker clientId=connect-1, groupId=streaming-service_streaming_service] Starting connectors and tasks using config offset 168
      [2024-05-13T09:58:25.233][INFO][request_id= ][tenant_id= ][thread=DistributedHerder-connect-1-1][class=org.apache.kafka.connect.runtime.distributed.DistributedHerder][method=startWork] [Worker clientId=connect-1, groupId=streaming-service_streaming_service] Finished starting connectors and tasks
      [2024-05-13T09:58:33.377][INFO][request_id= ][tenant_id= ][thread=qtp1184094435-70][class=org.apache.kafka.connect.runtime.rest.RestServer][method=write] 127.0.0.1 - - [13/May/2024:09:58:33 +0000] "GET / HTTP/1.1" 200 91 "-" "curl/8.5.0" 11
      [2024-05-13T09:58:35.051][INFO][request_id= ][tenant_id= ][thread=debezium-mongodbconnector-test_mongo_debezium-replica-set-monitor][class=org.mongodb.driver.client][method=info][mongodb-source-connector|worker]  MongoClient with metadata {"driver": {"name": "mongo-java-driver|sync", "version": "4.11.0"}, "os": {"type": "Linux", "name": "Linux", "architecture": "amd64", "version": "5.15.0-25-generic"}, "platform": "Java/Eclipse Adoptium/17.0.7+7"} created with settings MongoClientSettings{readPreference=primary, writeConcern=WriteConcern{w=null, wTimeout=null ms, journal=null}, retryWrites=true, retryReads=true, readConcern=ReadConcern{level=null}, credential=MongoCredential{mechanism=null, userName='root', source='admin', password=<hidden>, mechanismProperties=<hidden>}, transportSettings=null, streamFactoryFactory=null, commandListeners=[], codecRegistry=ProvidersCodecRegistry{codecProviders=[ValueCodecProvider{}, BsonValueCodecProvider{}, DBRefCodecProvider{}, DBObjectCodecProvider{}, DocumentCodecProvider{}, CollectionCodecProvider{}, IterableCodecProvider{}, MapCodecProvider{}, GeoJsonCodecProvider{}, GridFSFileCodecProvider{}, Jsr310CodecProvider{}, JsonObjectCodecProvider{}, BsonCodecProvider{}, EnumCodecProvider{}, com.mongodb.client.model.mql.ExpressionCodecProvider@11c98627, com.mongodb.Jep395RecordCodecProvider@5adf69bf, com.mongodb.KotlinCodecProvider@454dece4]}, loggerSettings=LoggerSettings{maxDocumentLength=1000}, clusterSettings={hosts=[mongos.mongodb:27017], srvServiceName=mongodb, mode=SINGLE, requiredClusterType=UNKNOWN, requiredReplicaSetName='null', serverSelector='null', clusterListeners='[]', serverSelectionTimeout='30000 ms', localThreshold='15 ms'}, socketSettings=SocketSettings{connectTimeoutMS=10000, readTimeoutMS=0, receiveBufferSize=0, proxySettings=ProxySettings{host=null, port=null, username=null, password=null}}, heartbeatSocketSettings=SocketSettings{connectTimeoutMS=10000, readTimeoutMS=10000, receiveBufferSize=0, proxySettings=ProxySettings{host=null, port=null, username=null, password=null}}, connectionPoolSettings=ConnectionPoolSettings{maxSize=100, minSize=0, maxWaitTimeMS=120000, maxConnectionLifeTimeMS=0, maxConnectionIdleTimeMS=0, maintenanceInitialDelayMS=0, maintenanceFrequencyMS=60000, connectionPoolListeners=[], maxConnecting=2}, serverSettings=ServerSettings{heartbeatFrequencyMS=10000, minHeartbeatFrequencyMS=500, serverListeners='[]', serverMonitorListeners='[]'}, sslSettings=SslSettings{enabled=false, invalidHostNameAllowed=false, context=null}, applicationName='null', compressorList=[], uuidRepresentation=UNSPECIFIED, serverApi=null, autoEncryptionSettings=null, dnsClient=null, inetAddressResolver=null, contextProvider=null}
      [2024-05-13T09:58:35.051][INFO][request_id= ][tenant_id= ][thread=debezium-mongodbconnector-test_mongo_debezium-replica-set-monitor][class=org.mongodb.driver.cluster][method=info][mongodb-source-connector|worker]  No server chosen by ReadPreferenceServerSelector{readPreference=primary} from cluster description ClusterDescription{type=UNKNOWN, connectionMode=SINGLE, serverDescriptions=[ServerDescription{address=mongos.mongodb:27017, type=UNKNOWN, state=CONNECTING}]}. Waiting for 30000 ms before timing out
      [2024-05-13T09:58:35.053][INFO][request_id= ][tenant_id= ][thread=cluster-ClusterId{value='6641e44b84e98d0a6e36b165', description='null'}-mongos.mongodb:27017][class=org.mongodb.driver.cluster][method=info][mongodb-source-connector|worker]  Monitor thread successfully connected to server with description ServerDescription{address=mongos.mongodb:27017, type=SHARD_ROUTER, state=CONNECTED, ok=true, minWireVersion=0, maxWireVersion=21, maxDocumentSize=16777216, logicalSessionTimeoutMinutes=30, roundTripTimeNanos=2013844}
      [2024-05-13T09:59:03.632][INFO][request_id= ][tenant_id= ][thread=qtp1184094435-66][class=org.apache.kafka.connect.runtime.rest.RestServer][method=write] 127.0.0.1 - - [13/May/2024:09:59:03 +0000] "GET / HTTP/1.1" 200 91 "-" "curl/8.5.0" 84
      [2024-05-13T09:59:05.051][INFO][request_id= ][tenant_id= ][thread=debezium-mongodbconnector-test_mongo_debezium-replica-set-monitor][class=org.mongodb.driver.client][method=info][mongodb-source-connector|worker]  MongoClient with metadata {"driver": {"name": "mongo-java-driver|sync", "version": "4.11.0"}, "os": {"type": "Linux", "name": "Linux", "architecture": "amd64", "version": "5.15.0-25-generic"}, "platform": "Java/Eclipse Adoptium/17.0.7+7"} created with settings MongoClientSettings{readPreference=primary, writeConcern=WriteConcern{w=null, wTimeout=null ms, journal=null}, retryWrites=true, retryReads=true, readConcern=ReadConcern{level=null}, credential=MongoCredential{mechanism=null, userName='root', source='admin', password=<hidden>, mechanismProperties=<hidden>}, transportSettings=null, streamFactoryFactory=null, commandListeners=[], codecRegistry=ProvidersCodecRegistry{codecProviders=[ValueCodecProvider{}, BsonValueCodecProvider{}, DBRefCodecProvider{}, DBObjectCodecProvider{}, DocumentCodecProvider{}, CollectionCodecProvider{}, IterableCodecProvider{}, MapCodecProvider{}, GeoJsonCodecProvider{}, GridFSFileCodecProvider{}, Jsr310CodecProvider{}, JsonObjectCodecProvider{}, BsonCodecProvider{}, EnumCodecProvider{}, com.mongodb.client.model.mql.ExpressionCodecProvider@11c98627, com.mongodb.Jep395RecordCodecProvider@5adf69bf, com.mongodb.KotlinCodecProvider@454dece4]}, loggerSettings=LoggerSettings{maxDocumentLength=1000}, clusterSettings={hosts=[mongos.mongodb:27017], srvServiceName=mongodb, mode=SINGLE, requiredClusterType=UNKNOWN, requiredReplicaSetName='null', serverSelector='null', clusterListeners='[]', serverSelectionTimeout='30000 ms', localThreshold='15 ms'}, socketSettings=SocketSettings{connectTimeoutMS=10000, readTimeoutMS=0, receiveBufferSize=0, proxySettings=ProxySettings{host=null, port=null, username=null, password=null}}, heartbeatSocketSettings=SocketSettings{connectTimeoutMS=10000, readTimeoutMS=10000, receiveBufferSize=0, proxySettings=ProxySettings{host=null, port=null, username=null, password=null}}, connectionPoolSettings=ConnectionPoolSettings{maxSize=100, minSize=0, maxWaitTimeMS=120000, maxConnectionLifeTimeMS=0, maxConnectionIdleTimeMS=0, maintenanceInitialDelayMS=0, maintenanceFrequencyMS=60000, connectionPoolListeners=[], maxConnecting=2}, serverSettings=ServerSettings{heartbeatFrequencyMS=10000, minHeartbeatFrequencyMS=500, serverListeners='[]', serverMonitorListeners='[]'}, sslSettings=SslSettings{enabled=false, invalidHostNameAllowed=false, context=null}, applicationName='null', compressorList=[], uuidRepresentation=UNSPECIFIED, serverApi=null, autoEncryptionSettings=null, dnsClient=null, inetAddressResolver=null, contextProvider=null}
      [2024-05-13T09:59:05.051][INFO][request_id= ][tenant_id= ][thread=debezium-mongodbconnector-test_mongo_debezium-replica-set-monitor][class=org.mongodb.driver.cluster][method=info][mongodb-source-connector|worker]  No server chosen by ReadPreferenceServerSelector{readPreference=primary} from cluster description ClusterDescription{type=UNKNOWN, connectionMode=SINGLE, serverDescriptions=[ServerDescription{address=mongos.mongodb:27017, type=UNKNOWN, state=CONNECTING}]}. Waiting for 30000 ms before timing out
      [2024-05-13T09:59:05.056][INFO][request_id= ][tenant_id= ][thread=cluster-ClusterId{value='6641e46984e98d0a6e36b166', description='null'}-mongos.mongodb:27017][class=org.mongodb.driver.cluster][method=info][mongodb-source-connector|worker]  Monitor thread successfully connected to server with description ServerDescription{address=mongos.mongodb:27017, type=SHARD_ROUTER, state=CONNECTED, ok=true, minWireVersion=0, maxWireVersion=21, maxDocumentSize=16777216, logicalSessionTimeoutMinutes=30, roundTripTimeNanos=3063624}
      
      
      I suppose the main problem is, that driver has no idea what it connects to, for some reason: No server chosen by ReadPreferenceServerSelector{readPreference=primary} from cluster description ClusterDescription{type=UNKNOWN, connectionMode=SINGLE, serverDescriptions=[ServerDescription{address=mongos.mongodb:27017, type=UNKNOWN, state=CONNECTING}]}. Waiting for 30000 ms before timing out
      
      
      While the monitoring connects successfully with existing type=SHARD_ROUTER:
      Monitor thread successfully connected to server with description ServerDescription{address=mongos.mongodb:27017, type=SHARD_ROUTER, state=CONNECTED, ok=true, minWireVersion=0, maxWireVersion=21, maxDocumentSize=16777216, logicalSessionTimeoutMinutes=30, roundTripTimeNanos=165267 

       

       

      How to reproduce the issue using our tutorial deployment?

      Deploy Kafka and Create Mongo connector, try to stream data from mongo 7 (we have 7.0.5 version) to Kafka

      Note: This is not working with REPLICA_SET mode, issue appears only for SHARDED mode.

      For some reason, there's an exception for Java driver during connection to Mongo.

            jcechace@redhat.com Jakub Čecháček
            pavelyadrov Pavel Yadrov (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

              Created:
              Updated:
              Resolved: