[
https://issues.apache.org/jira/browse/BEAM-14449?focusedWorklogId=775263&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-775263
]
ASF GitHub Bot logged work on BEAM-14449:
-----------------------------------------
Author: ASF GitHub Bot
Created on: 26/May/22 22:16
Start Date: 26/May/22 22:16
Worklog Time Spent: 10m
Work Description: KevinGG commented on code in PR #17736:
URL: https://github.com/apache/beam/pull/17736#discussion_r883122855
##########
sdks/python/apache_beam/runners/interactive/interactive_beam.py:
##########
@@ -475,7 +485,9 @@ def cleanup(
'options is deprecated since First stable release. References to '
'<pipeline>.options will not be supported',
category=DeprecationWarning)
- p.options.view_as(FlinkRunnerOptions).flink_master = '[auto]'
+ p_flink_options = p.options.view_as(FlinkRunnerOptions)
+ p_flink_options.flink_master = '[auto]'
+ p_flink_options.flink_version = None
Review Comment:
Because Dataproc only supports a constant Flink version and we always
override to that version.
We set it to None in case the user wants to use a different cluster but
forgets to set the Flink version they want to use.
Beam's default value is the latest hard coded published Flink version.
Issue Time Tracking
-------------------
Worklog Id: (was: 775263)
Time Spent: 1h 10m (was: 1h)
> Support cluster provisioning when using Flink on Dataproc
> ---------------------------------------------------------
>
> Key: BEAM-14449
> URL: https://issues.apache.org/jira/browse/BEAM-14449
> Project: Beam
> Issue Type: Improvement
> Components: runner-py-interactive
> Reporter: Ning
> Assignee: Ning
> Priority: P2
> Attachments: image-2022-05-16-11-25-32-904.png,
> image-2022-05-16-11-28-12-702.png
>
> Time Spent: 1h 10m
> Remaining Estimate: 0h
>
> Provide the capability for the user to explicitly provision a cluster.
> Current implementation provisions each cluster at the location specified by
> GoogleCloudOptions using 3 worker nodes. There is no explicit API to
> configure the number or shape of workers.
> We could use the WorkerOptions to allow customers to explicitly provision a
> cluster and expose an explicit API (with UX in notebook extension) for
> customers to change the size of a cluster connected with their notebook
> (until we have an auto scaling solution with Dataproc for Flink).
> The API looks like this when configuring the workers for a dataproc cluster
> when creating it:
> !image-2022-05-16-11-25-32-904.png!
> An example request setting the masterConfig and workerConfig:
> !image-2022-05-16-11-28-12-702.png!
--
This message was sent by Atlassian Jira
(v8.20.7#820007)