Hi everyone. I'm having trouble making a performance test work for the
Debezium connector. This test reads the events from a PostgreSQL database
produced by a number of operations (inserts, deletes, updates) done for ~20
min.
When running in DirectRunner the pipeline reads the messages, stops, and
outputs the messages that were read. The problem is that when it runs in
DataflowRunner, the pipeline doesn't stop and seems to be doing nothing,
since it's not making any progress or printing any helpful logs.

I know that DebeziumIO is still experimental so I'm not sure if it lacks
some feature that is causing it not to run properly in Dataflow or if it
needs some specific configuration.

Thank you all for your help.

Here's the PR and a Dataflow run:
- https://github.com/apache/beam/pull/22344
-
https://console.cloud.google.com/dataflow/jobs/us-west1/2022-12-07_12_51_09-13497028272697059516;bottomTab=JOB_LOGS;graphView=0;logsSeverity=ERROR?pageState=(%22dfTime%22:(%22s%22:%222022-12-07T20:51:09.921Z%22,%22e%22:%222022-12-07T21:55:08.936Z%22))&project=apache-beam-testing


- Mike Hernandez

-- 
*This email and its contents (including any attachments) are being sent to
you on the condition of confidentiality and may be protected by legal
privilege. Access to this email by anyone other than the intended recipient
is unauthorized. If you are not the intended recipient, please immediately
notify the sender by replying to this message and delete the material
immediately from your system. Any further use, dissemination, distribution
or reproduction of this email is strictly prohibited. Further, no
representation is made with respect to any content contained in this email.*

Reply via email to