[ 
https://issues.apache.org/jira/browse/BEAM-10670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17334972#comment-17334972
 ] 

Kenneth Knowles commented on BEAM-10670:
----------------------------------------

So the idea is this:

 - Every BoundedSource-based Read transform should still have Read in its 
expansion. I don't know if this was changed. The SDK should be able to build 
one graph and have it work optimally on runners based on their preferences.
 - Inside that expansion there may be SDF, but any runner can ignore it, or 
have a flag to enable it. Again, I don't think the SDK should change its 
expansion.

Can you itemize where this does not work? Was the SparkRunner changed? Were the 
IOs made incompatible?

> Make non-portable Splittable DoFn the only option when executing Java "Read" 
> transforms
> ---------------------------------------------------------------------------------------
>
>                 Key: BEAM-10670
>                 URL: https://issues.apache.org/jira/browse/BEAM-10670
>             Project: Beam
>          Issue Type: Improvement
>          Components: sdk-java-core
>            Reporter: Luke Cwik
>            Assignee: Kenneth Knowles
>            Priority: P3
>              Labels: Clarified
>             Fix For: 2.30.0
>
>          Time Spent: 37h 50m
>  Remaining Estimate: 0h
>
> All runners seem to be capable of migrating to splittable DoFn for 
> non-portable execution except for Dataflow runner v1 which will internalize 
> the current primitive read implementation that is shared across runner 
> implementations.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to