AFAIK there is no theoretical limit on the size of the plan, it just depends on the available resources.

The job submissions times out since it takes too long to deploy all the operators that the job defines. With 300 flows, each with 6 operators you're looking at potentially (1800 * parallelism) tasks that have to be deployed. For each task Flink copies the user-code of /all/ flows to the executing TaskManager, which the network may just not be handle in time.

I suggest to split your job into smaller batches or even run each of them independently.

On 31.10.2017 16:25, Chan, Regina wrote:

Asking an additional question, what is the largest plan that the JobManager can handle? Is there a limit? My flows don’t need to run in parallel and can run independently. I wanted them to run in one single job because it’s part of one logical commit on my side.

Thanks,

Regina

*From:*Chan, Regina [Tech]
*Sent:* Monday, October 30, 2017 3:22 PM
*To:* 'user@flink.apache.org'
*Subject:* Job Manager Configuration

Flink Users,

I have about 300 parallel flows in one job each with 2 inputs, 3 operators, and 1 sink which makes for a large job. I keep getting the below timeout exception but I’ve already set it to a 30 minute time out with a 6GB heap on the JobManager? Is there a heuristic to better configure the job manager?

Caused by: org.apache.flink.runtime.client.JobClientActorSubmissionTimeoutException: Job submission to the JobManager timed out. You may increase 'akka.client.timeout' in case the JobManager needs more time to configure and confirm the job submission.

*Regina Chan*

*Goldman Sachs**–*Enterprise Platforms, Data Architecture

*30 Hudson Street, 37th floor | Jersey City, NY 07302*((212) 902-5697**


Reply via email to