Pablo Estrada created BEAM-14161:
------------------------------------

             Summary: Add dynamic splitting to JdbcIO.readWithPartitions
                 Key: BEAM-14161
                 URL: https://issues.apache.org/jira/browse/BEAM-14161
             Project: Beam
          Issue Type: Improvement
          Components: io-java-jdbc
            Reporter: Pablo Estrada
            Assignee: Jean-Baptiste Onofré
             Fix For: Not applicable


Now, the JDBC IO is basically a {{DoFn}} executed with a {{ParDo}}. So, it 
means that parallelism is "limited" and executed on one executor.
We can imagine to create several JDBC {{BoundedSource}}s splitting the SQL 
query in  subset (for instance using row id paging or any "splitting/limit" we 
can figure based on the original SQL query) (something similar to what Sqoop is 
doing).



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to