[DBZ-2754] Oracle Connector(Using Logminer) with Oracle RDS (v12) does not capture changes

Type: Bug
Resolution: Done
Priority: Major
Fix Version/s: 1.4.2.Final, 1.5.0.Beta1
Affects Version/s: 1.3.0.Final, 1.4.0.Alpha1
Component/s: oracle-connector
Labels:
None

Blocked:
False
Ready:
False
Release Note Text:
Undefined
Git Pull Request:
https://github.com/debezium/debezium-incubator/pull/251, https://github.com/debezium/debezium/pull/2103
Steps to Reproduce:
1. Create new RDS Oracle database (v12)
2. Create table with dummy data
3. Submit connector to dbz
4. Insert records into table
Market:

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Trying to test the oracle connector with the AWS RDS version of Oracle.

Everything works fine when debezium is taking the initial snapshot of the table. But whenever new records get inserted, the changes are not streaming to the kafka topic. After a pretty long delay (~20mins or so) this shows up in the logs:

2020-11-12 03:30:18,713 WARN   ||  All transactions with first SCN <= 577584 will be abandoned, offset: 576745   [io.debezium.connector.oracle.logminer.LogMinerHelper]
2020-11-12 03:30:18,713 WARN   ||  Following long running transaction 09000F0095010000 will be abandoned and ignored: Transaction{firstScn=576785, lastScn=576785, redoSqls=[[insert into "ADMIN"."CUSTOMERS"("ID","FIRST_NAME","SALARY") values ('4','tammy','900');]]}    [io.debezium.connector.oracle.logminer.LogMinerHelper]
2020-11-12 03:30:18,714 WARN   ||  Following long running transaction 0800210083010000 will be abandoned and ignored: Transaction{firstScn=576748, lastScn=576748, redoSqls=[[insert into "ADMIN"."CUSTOMERS"("ID","FIRST_NAME","SALARY") values ('3','timmo','600');]]}    [io.debezium.connector.oracle.logminer.LogMinerHelper]
2020-11-12 03:30:18,714 WARN   ||  Following long running transaction 09001B0094010000 will be abandoned and ignored: Transaction{firstScn=576826, lastScn=576826, redoSqls=[[insert into "ADMIN"."CUSTOMERS"("ID","FIRST_NAME","SALARY") values ('5','cardi b','1000');]]}    [io.debezium.connector.oracle.logminer.LogMinerHelper]

Not sure if this is linked to the fact that the oracle connector requires the oracle database to have 2 log files per group, because this is not possible in RDS Oracle.

relates to

DBZ-2759 long running transaction will be abandoned and ignored

Closed

Gunnar Morling added a comment - 2021/02/17 6:23 AM

The transaction buffer size can now be adjusted, so the maximum duration of supported transactions is a function of how much memory you can afford for that buffer. Via ~~DBZ-3123~~ we're planning to explore a more scalable solution, e.g. using off-heap memory or a distributed K/V store.

ccranfor@redhat.com, in the mean time: is there a very clear ERROR message in the logs in case events from a long-running transaction are discarded due to this?

Gunnar Morling added a comment - 2021/02/17 6:23 AM The transaction buffer size can now be adjusted, so the maximum duration of supported transactions is a function of how much memory you can afford for that buffer. Via DBZ-3123 we're planning to explore a more scalable solution, e.g. using off-heap memory or a distributed K/V store. ccranfor@redhat.com , in the mean time: is there a very clear ERROR message in the logs in case events from a long-running transaction are discarded due to this?

Chris Cranford added a comment - 2021/02/04 2:38 PM

I took a deeper look at the warnings, these are actually normal.

When the connector detects a LOGSWITCH there is a certain amount of bookkeeping operations that need to occur within the connector's internal buffers. One of those operations is to check if there are any long running transactions and determine how to handle those. If there are any long running transactions, this prevents the connector from being able to flush the current starting SCN that is being used for mining operations to the offsets. This means should the connector be restarted or stop unexpectedly, it will restart at the SCN from the offsets, which means lots of events will be re-emitted because we need to reprocess those long running transactions where we never saw a commit yet.

The connector uses a fixed 4 hour window for maintaining long running transactions, which may be too small for certain environments and should be configurable. I'll look at adding this as a configurable option so that when a LOGSWITCH is detected and if you maintain redo & archive logs for a longer period of time, you can customize the setting to your needs.

That said, I'm curious if this might be related to a recent bug where not all COMMIT operations were being analyzed and this caused some transactions that used auto-commit mode to be abandoned and not processed correctly. I would strong suggest you upgrade to 1.4.1.Final as a baseline and see if this persists. In the meantime I'm going to proceed to making this fixed 4 hour window configurable either way since the nature of everyone's environment can be different.

Chris Cranford added a comment - 2021/02/04 2:38 PM I took a deeper look at the warnings, these are actually normal. When the connector detects a LOGSWITCH there is a certain amount of bookkeeping operations that need to occur within the connector's internal buffers. One of those operations is to check if there are any long running transactions and determine how to handle those. If there are any long running transactions, this prevents the connector from being able to flush the current starting SCN that is being used for mining operations to the offsets. This means should the connector be restarted or stop unexpectedly, it will restart at the SCN from the offsets, which means lots of events will be re-emitted because we need to reprocess those long running transactions where we never saw a commit yet. The connector uses a fixed 4 hour window for maintaining long running transactions, which may be too small for certain environments and should be configurable. I'll look at adding this as a configurable option so that when a LOGSWITCH is detected and if you maintain redo & archive logs for a longer period of time, you can customize the setting to your needs. That said, I'm curious if this might be related to a recent bug where not all COMMIT operations were being analyzed and this caused some transactions that used auto-commit mode to be abandoned and not processed correctly. I would strong suggest you upgrade to 1.4.1.Final as a baseline and see if this persists. In the meantime I'm going to proceed to making this fixed 4 hour window configurable either way since the nature of everyone's environment can be different.

Chris Cranford added a comment - 2021/01/22 6:48 PM

mngeow can you share the steps you took to configure the RDS Oracle environment, the LogMiner user, and your connector's full configuration?

Chris Cranford added a comment - 2021/01/22 6:48 PM mngeow can you share the steps you took to configure the RDS Oracle environment, the LogMiner user, and your connector's full configuration?

Marcus ngeow (Inactive) added a comment - 2020/11/22 1:09 AM - edited

Yeah this is the same account the is used by the connector.

Not sure if this helps, but this is taken from the documentation:

When you create a DB instance, the master user account that you use to create the instance gets DBA privileges (with some limitations). Use the master user account for any administrative tasks such as creating additional user accounts in the database. You can't use the SYS user, SYSTEM user, and other Oracle-supplied administrative accounts.

I was doing everything with the master user account.

Marcus ngeow (Inactive) added a comment - 2020/11/22 1:09 AM - edited Yeah this is the same account the is used by the connector. Not sure if this helps, but this is taken from the documentation : When you create a DB instance, the master user account that you use to create the instance gets DBA privileges (with some limitations). Use the master user account for any administrative tasks such as creating additional user accounts in the database. You can't use the SYS user, SYSTEM user, and other Oracle-supplied administrative accounts. I was doing everything with the master user account.

Chris Cranford added a comment - 2020/11/19 11:07 AM

Is this admin account the same account used by the connector?

Chris Cranford added a comment - 2020/11/19 11:07 AM Is this admin account the same account used by the connector?

Marcus ngeow (Inactive) added a comment - 2020/11/19 1:20 AM

ccranfor@redhat.com, the transactions were performed using the default admin account that you get when creating an RDS instance on AWS.

Marcus ngeow (Inactive) added a comment - 2020/11/19 1:20 AM ccranfor@redhat.com , the transactions were performed using the default admin account that you get when creating an RDS instance on AWS.

Chris Cranford added a comment - 2020/11/18 5:16 PM

mngeow, can you confirm if the transactions that are being abandoned are being performed by SYS, SYSTEM, or the LogMiner user account?

Chris Cranford added a comment - 2020/11/18 5:16 PM mngeow , can you confirm if the transactions that are being abandoned are being performed by SYS , SYSTEM , or the LogMiner user account?

René Rütter (Inactive) added a comment - 2020/11/13 6:03 AM

Hi, my ~~DBZ-2759~~ is quite about the same issue

René Rütter (Inactive) added a comment - 2020/11/13 6:03 AM Hi, my DBZ-2759 is quite about the same issue

Assignee:: Chris Cranford

Reporter:: Marcus ngeow (Inactive)

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Created:: 2020/11/11 11:11 PM

Updated:: 2021/10/24 6:07 AM

Resolved:: 2021/02/24 9:37 AM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

Collapse comment: Gunnar Morling added a comment - 2021/02/17 6:23 AM

Expand comment: Gunnar Morling added a comment - 2021/02/17 6:23 AM

Collapse comment: Chris Cranford added a comment - 2021/02/04 2:38 PM

Expand comment: Chris Cranford added a comment - 2021/02/04 2:38 PM

Collapse comment: Chris Cranford added a comment - 2021/01/22 6:48 PM

Expand comment: Chris Cranford added a comment - 2021/01/22 6:48 PM

Collapse comment: Marcus ngeow (Inactive) added a comment - 2020/11/22 1:09 AM, Edited by Marcus ngeow - 2020/11/22 1:14 AM

Expand comment: Marcus ngeow (Inactive) added a comment - 2020/11/22 1:09 AM, Edited by Marcus ngeow - 2020/11/22 1:14 AM

Collapse comment: Chris Cranford added a comment - 2020/11/19 11:07 AM

Expand comment: Chris Cranford added a comment - 2020/11/19 11:07 AM

Collapse comment: Marcus ngeow (Inactive) added a comment - 2020/11/19 1:20 AM

Expand comment: Marcus ngeow (Inactive) added a comment - 2020/11/19 1:20 AM

Collapse comment: Chris Cranford added a comment - 2020/11/18 5:16 PM

Expand comment: Chris Cranford added a comment - 2020/11/18 5:16 PM

Collapse comment: René Rütter (Inactive) added a comment - 2020/11/13 6:03 AM

Expand comment: René Rütter (Inactive) added a comment - 2020/11/13 6:03 AM

People

Dates