[
https://issues.apache.org/jira/browse/BEAM-2026?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17133022#comment-17133022
]
Lai Zhou commented on BEAM-2026:
--------------------------------
I'll try to make a high performance direct runner, which would have these
features:
1. WholeStageCodegen
2. beam sql plan cache
Lets say that we can convert a PTransform to a 'WholeStageCodegen' java class ,
which will consume PInput data, and produce POutput data.
> High performance direct runner
> ------------------------------
>
> Key: BEAM-2026
> URL: https://issues.apache.org/jira/browse/BEAM-2026
> Project: Beam
> Issue Type: New Feature
> Components: runner-direct
> Reporter: Mitar
> Assignee: Mitar
> Priority: P2
> Labels: stale-assigned
>
> In documentation (https://beam.apache.org/documentation/runners/direct/) it
> is written that direct runner does not try to run efficiently, but it serves
> mostly for development and debugging.
> I would suggest that there should be also an efficient direct runner. If Beam
> tries to be an unified programming model, for some smaller tasks I would love
> to implement them in Beam, just to keep the code in the same model, but it
> would be OK to run it as a normal smaller program (maybe inside one Docker
> container), without any distribution across multiple machines. In the future,
> if usage grows, I could then replace underlying runner with something
> distributed.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)