[jira] [Commented] (BEAM-1631) Flink runner: submit job to a Flink-on-YARN cluster

Aljoscha Krettek (JIRA) Tue, 18 Apr 2017 05:49:42 -0700

    [ 
https://issues.apache.org/jira/browse/BEAM-1631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15972593#comment-15972593
 ]


Aljoscha Krettek commented on BEAM-1631:
----------------------------------------

Yes, I think we would still need to have a {{HADOOP_CONF_DIR}} set. I think 
right now it's not possible to solve this in a nice way. If was thinking of 
pointing the Flink Runner to the directory of a Flink installation and then use 
the normal Flink submission script from Java to submit the Job Jar. That's the 
easiest solution I could come up with but it's quite hacky.

There is stuff happening around FLINK-6 
(https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=65147077) 
which will introduce the notion of a dispatcher that will accept jobs from 
clients and then bring up a per-job cluster on YARN (or other cluster 
management systems). I think once we have this job submission on the Flink 
Runner should be very smooth.

> Flink runner: submit job to a Flink-on-YARN cluster
> ---------------------------------------------------
>
>                 Key: BEAM-1631
>                 URL: https://issues.apache.org/jira/browse/BEAM-1631
>             Project: Beam
>          Issue Type: New Feature
>          Components: runner-flink
>            Reporter: Davor Bonaci
>            Assignee: Aljoscha Krettek
>
> As far as I understand, running Beam pipelines on a Flink cluster can be done 
> in two ways:
> * Run directly with a Flink runner, and specifying {{--flinkMaster}} pipeline 
> option via, say, {{mvn exec}}.
> * Produce a bundled JAR, and use {{bin/flink}} to submit the same pipeline.
> These two ways are equivalent, and work well on a standalone Flink cluster.
> Submitting to a Flink-on-YARN is more complicated. You can still produce a 
> bundled JAR, and use {{bin/flink -yid <applicationid>}} to submit such a job. 
> However, that seems impossible with a Flink runner directly.
> If so, we should add the ability to the Flink runner to submit a job to a 
> Flink-on-YARN cluster directly.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

[jira] [Commented] (BEAM-1631) Flink runner: submit job to a Flink-on-YARN cluster

Reply via email to