[jira] [Commented] (BEAM-2026) High performance direct runner

Lai Zhou (Jira) Thu, 11 Jun 2020 00:51:16 -0700


    [ 
https://issues.apache.org/jira/browse/BEAM-2026?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17133022#comment-17133022
 ]


Lai Zhou commented on BEAM-2026:
--------------------------------

I'll try to make a  high performance direct runner,  which would have these  
features:
1. WholeStageCodegen
2. beam sql plan cache 

Lets say that we can convert a PTransform to a 'WholeStageCodegen' java class , 
which will consume PInput data, and produce POutput data.


> High performance direct runner
> ------------------------------
>
>                 Key: BEAM-2026
>                 URL: https://issues.apache.org/jira/browse/BEAM-2026
>             Project: Beam
>          Issue Type: New Feature
>          Components: runner-direct
>            Reporter: Mitar
>            Assignee: Mitar
>            Priority: P2
>              Labels: stale-assigned
>
> In documentation (https://beam.apache.org/documentation/runners/direct/) it 
> is written that direct runner does not try to run efficiently, but it serves 
> mostly for development and debugging.
> I would suggest that there should be also an efficient direct runner. If Beam 
> tries to be an unified programming model, for some smaller tasks I would love 
> to implement them in Beam, just to keep the code in the same model, but it 
> would be OK to run it as a normal smaller program (maybe inside one Docker 
> container), without any distribution across multiple machines. In the future, 
> if usage grows, I could then replace underlying runner with something 
> distributed.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Commented] (BEAM-2026) High performance direct runner

Reply via email to