[ 
https://issues.apache.org/jira/browse/BEAM-14161?focusedWorklogId=754428&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-754428
 ]

ASF GitHub Bot logged work on BEAM-14161:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 08/Apr/22 05:57
            Start Date: 08/Apr/22 05:57
    Worklog Time Spent: 10m 
      Work Description: aaltay commented on PR #16863:
URL: https://github.com/apache/beam/pull/16863#issuecomment-1092468127

   @pabloem - Would you be able to respond to the open comments?




Issue Time Tracking
-------------------

    Worklog Id:     (was: 754428)
    Time Spent: 0.5h  (was: 20m)

> Add dynamic splitting to JdbcIO.readWithPartitions
> --------------------------------------------------
>
>                 Key: BEAM-14161
>                 URL: https://issues.apache.org/jira/browse/BEAM-14161
>             Project: Beam
>          Issue Type: Improvement
>          Components: io-java-jdbc
>            Reporter: Pablo Estrada
>            Assignee: Jean-Baptiste Onofré
>            Priority: P2
>             Fix For: Not applicable
>
>          Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> Now, the JDBC IO is basically a {{DoFn}} executed with a {{{}ParDo{}}}. So, 
> it means that parallelism is "limited" and executed on one executor. 
> ReadWithPartitions does some preliminary partitioning of the data, but any 
> skew in data range or workload will create an unbalanced workload.
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to