[
https://issues.apache.org/jira/browse/FLINK-12801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jark Wu closed FLINK-12801.
---------------------------
Resolution: Fixed
Assignee: xupingyong
Fix Version/s: 1.9.0
Fixed in 1.9.0: 6f5425fc9798510fef33ccd7bb81d4b9f59bffa5
> Set parallelism for batch SQL
> -----------------------------
>
> Key: FLINK-12801
> URL: https://issues.apache.org/jira/browse/FLINK-12801
> Project: Flink
> Issue Type: Task
> Components: Table SQL / Planner
> Reporter: XuPingyong
> Assignee: xupingyong
> Priority: Major
> Labels: pull-request-available
> Fix For: 1.9.0
>
> Time Spent: 10m
> Remaining Estimate: 0h
>
> DataStream user can set parallelism by
> SingleOutputStreamOperator#setParallelism and DataStreamSink#setParallelism.
> But SQL users cannot set parallelism to operators while compiled jobGraphs
> from SQL are usally complex.
> Now we first set parallelism for batch SQL by config. We introduce two
> resourceSetting mode:
> InferMode.NONE: User can set parallelism to source, sink and other
> nodes separately.
> InferMode.ONLY_SOURCE: Relative to InferMode.NONE, source paralelism
> can be inferred by source row count.
> We also introduce ShuffleStage to make adjacent operatos parallelism
> same that there is no data shuffle between them.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)