Loading...

Type: Bug
Resolution: Done
Priority: Major
Fix Version/s: 10.0.0.Final
Affects Version/s: 10.0.0.Beta1
Component/s: Batch
Labels:
None

Steps to Reproduce:
Hide

1. start a domain from the default configuration
2. for main-server-group deploy a JDBC driver for oracle, db2 or sybase and create a data source in full profile using the driver
3.

/profile=full/subsystem=batch-jberet/jdbc-job-repository=jdbc:add(data-source=$DS_NAME)

4. server-one and server-two race to create the tables, resulting in:

{ "outcome" => "failed", "result" => undefined, "failure-description" => {"WFLYDC0074: Operation failed or was rolled back on all servers. Server failures:" => {"server-group" => {"main-server-group" => {"host" => {"master" => {"server-two" => {"WFLYCTL0080: Failed services" => {"org.wildfy.batch.job.repository.jdbc" => "org.jboss.msc.service.StartException in service org.wildfy.batch.job.repository.jdbc: WFLYBATCH000011: Failed to create JDBC job repository. Caused by: javax.batch.operations.BatchRuntimeException: JBERET000624: Failed to create tables for batch job repository database product name Adaptive Server Enterprise with DDL sql/jberet-sybase.ddl Caused by: java.sql.BatchUpdateException: JZ0BE: BatchUpdateException: Error occurred while executing batch statement: There is already an object named 'JOB_INSTANCE' in the database. "}}}}}}}}, "rolled-back" => true, "server-groups" => {"main-server-group" => {"host" => {"master" => { "server-one" => {"response" => { "outcome" => "failed", "rolled-back" => true }}, "server-two" => {"response" => { "outcome" => "failed", "failure-description" => {"WFLYCTL0080: Failed services" => {"org.wildfy.batch.job.repository.jdbc" => "org.jboss.msc.service.StartException in service org.wildfy.batch.job.repository.jdbc: WFLYBATCH000011: Failed to create JDBC job repository. Caused by: javax.batch.operations.BatchRuntimeException: JBERET000624: Failed to create tables for batch job repository database product name Adaptive Server Enterprise with DDL sql/jberet-sybase.ddl Caused by: java.sql.BatchUpdateException: JZ0BE: BatchUpdateException: Error occurred while executing batch statement: There is already an object named 'JOB_INSTANCE' in the database. "}}, "rolled-back" => true }} }}}} }
Show
1. start a domain from the default configuration 2. for main-server-group deploy a JDBC driver for oracle, db2 or sybase and create a data source in full profile using the driver 3. /profile=full/subsystem=batch-jberet/jdbc-job-repository=jdbc:add(data-source=$DS_NAME) 4. server-one and server-two race to create the tables, resulting in: { "outcome" => "failed", "result" => undefined, "failure-description" => {"WFLYDC0074: Operation failed or was rolled back on all servers. Server failures:" => {"server-group" => {"main-server-group" => {"host" => {"master" => {"server-two" => {"WFLYCTL0080: Failed services" => {"org.wildfy.batch.job.repository.jdbc" => "org.jboss.msc.service.StartException in service org.wildfy.batch.job.repository.jdbc: WFLYBATCH000011: Failed to create JDBC job repository. Caused by: javax.batch.operations.BatchRuntimeException: JBERET000624: Failed to create tables for batch job repository database product name Adaptive Server Enterprise with DDL sql/jberet-sybase.ddl Caused by: java.sql.BatchUpdateException: JZ0BE: BatchUpdateException: Error occurred while executing batch statement: There is already an object named 'JOB_INSTANCE' in the database. "}}}}}}}}, "rolled-back" => true, "server-groups" => {"main-server-group" => {"host" => {"master" => { "server-one" => {"response" => { "outcome" => "failed", "rolled-back" => true }}, "server-two" => {"response" => { "outcome" => "failed", "failure-description" => {"WFLYCTL0080: Failed services" => {"org.wildfy.batch.job.repository.jdbc" => "org.jboss.msc.service.StartException in service org.wildfy.batch.job.repository.jdbc: WFLYBATCH000011: Failed to create JDBC job repository. Caused by: javax.batch.operations.BatchRuntimeException: JBERET000624: Failed to create tables for batch job repository database product name Adaptive Server Enterprise with DDL sql/jberet-sybase.ddl Caused by: java.sql.BatchUpdateException: JZ0BE: BatchUpdateException: Error occurred while executing batch statement: There is already an object named 'JOB_INSTANCE' in the database. "}}, "rolled-back" => true }} }}}} }
Workaround:

Workaround Exists
Workaround Description:

Hide

Because one node will (always, I think) succeed to create all tables and the rollback of the management operation doesn't rollback the database changes, issuing the command in step 3 for the second time will succeed, because all tables already exist this time.

Show
Because one node will (always, I think) succeed to create all tables and the rollback of the management operation doesn't rollback the database changes, issuing the command in step 3 for the second time will succeed, because all tables already exist this time.

When multiple servers (eg. a server-group in a JBoss domain) attempt to create a JDBC Batch repository at once, and they share a common database, they race in creating the database tables, because sometimes instance A will create the table PARTITION_EXECUTION after instance B invokes the check whether this table exists (this check is used to decide whether all tables need to be created) - therefore B will decide to create all tables too.
If the database doesn't support the IF NOT EXISTS clause in CREATE TABLE (db2, oracle, sybase..), it will probably lead to a failure like 'table already exists', which in turn rolls back the creation of the JDBC store on all nodes in the domain, including those where the schema creation succeeded or was skipped. The tables remain physically in the database.

This is similar to ~~WFLY-5061~~, but this time, it is caused by having multiple servers in a domain.

Sharing a common database in domain mode should work because it is useful for restarting failed jobs on different nodes if one node crashes.

clones

JBEAP-758 Error creating Batch JDBC storage in domain

Closed

is blocked by

WFLY-5174 Upgrade jberet-core to 1.2.0.Beta1

Closed

is related to

JBERET-185 Add test to verify race condition when concurrent clients creating jdbc repository

Resolved

relates to

WFLY-5061 Tables for JDBC Batch repository are created twice

Closed

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates