[ 
https://issues.apache.org/jira/browse/BEAM-12135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ismaël Mejía updated BEAM-12135:
--------------------------------
    Description: Spark Runner and maybe all other runners that deal with batch 
only data might benefit of a batch optimized translation where details about 
the full Beam model matter less because we are in Global window, no panes info 
is needed and the we are in the min timestamp. With this premise the records 
can be encoded as 'value only' WindowValues and transforms like GroupByKey may 
ignore windowing (GABW) to improve performance.  (was: Spark Runner and maybe 
all other runners that deal with batch only data might benefit of a batch 
optimized translation where internal records are encoded as 'value only' 
WindowValues and operations like GroupByKey may ignore windowing (GABW) to 
improve performance.)

> Batch optimized translation for Spark Runner
> --------------------------------------------
>
>                 Key: BEAM-12135
>                 URL: https://issues.apache.org/jira/browse/BEAM-12135
>             Project: Beam
>          Issue Type: Improvement
>          Components: runner-spark
>            Reporter: Ismaël Mejía
>            Priority: P2
>
> Spark Runner and maybe all other runners that deal with batch only data might 
> benefit of a batch optimized translation where details about the full Beam 
> model matter less because we are in Global window, no panes info is needed 
> and the we are in the min timestamp. With this premise the records can be 
> encoded as 'value only' WindowValues and transforms like GroupByKey may 
> ignore windowing (GABW) to improve performance.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to