Loading...

XML

Word

Printable

Type: Enhancement
Resolution: Done
Priority: Major
Fix Version/s: 0.3.6, 0.4
Affects Version/s: 0.3.4
Component/s: mysql-connector
Labels:
None

Git Pull Request:
https://github.com/debezium/debezium/pull/157

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

We are testing Debezium on some fairly large tables (a couple of hundred million rows), and our production databases are even bigger. We are noticing that Debezium seems to hang for quite some time before it starts the actual snapshot of each table.

After a couple of threaddumps the hang seems to be caused by the SELECT COUNT(*) FROM <table> in io.debezium.connector.mysql.SnapshotReader.execute. This kind of query can be very slow for large InnoDB tables.

It would be great to have a configuration option to always use the streaming resultset (and skip the select count query), or optimize this to get an approximate table size faster.

For example, MySQL has a `show table status like <tableName>` that returns an approximate row count, perhaps that would be good enough for this use case.

Assignee:: Randall Hauch (Inactive)

Reporter:: Dennis Persson (Inactive)

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2016/11/14 6:33 AM

Updated:: 2017/08/17 9:07 AM

Resolved:: 2016/12/20 6:52 PM

Details

Description

Attachments

Easy Agile Planning Poker

Activity

People

Dates