[ 
https://issues.apache.org/jira/browse/FLINK-8137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joshua Griffith updated FLINK-8137:
-----------------------------------
    Description: 
When submitting a new {{FlinkPlan}} using the {{StandaloneClusterClient}}, the 
JobManager's REST API appears to hang until the job is submitted. The 
submission time may be large enough to cause timeouts if the {{getStatistics}} 
and {{createInputSplits}} methods of a job's {{RichInputFormat}} s perform 
time-intensive tasks like running external queries. This is exacerbated when a 
job contains many such input formats, since they appear to be initialized 
sequentially. For a particular job with over 100 inputs, it's typical for the 
API (and consequently the web UI) to be non-responsive for 45–60 seconds.

Would it make sense for tasks to have a {{Configuring}} state before the 
{{Created}} state to provide greater visibility and indicate that the 
JobManager is still healthy?

  was:
When submitting a new {{FlinkPlan}} using the {{StandaloneClusterClient}}, the 
JobManager's REST API appears to hang until the job is submitted. The 
submission time may be large enough to cause timeouts if the {{getStatistics}} 
and {{createInputSplits}} methods of a job's {{RichInputFormat}}s perform 
time-intensive tasks like running external queries. This is exacerbated when a 
job contains many such input formats, since they appear to be initialized 
sequentially. For a particular job with over 100 inputs, it's typical for the 
API (and consequently the web UI) to be non-responsive for 45–60 seconds.

Would it make sense for tasks to have a {{Configuring}} state before the 
{{Created}} state to provide greater visibility and indicate that the 
JobManager is still healthy?


> Flink JobManager API non-responsive during job submission
> ---------------------------------------------------------
>
>                 Key: FLINK-8137
>                 URL: https://issues.apache.org/jira/browse/FLINK-8137
>             Project: Flink
>          Issue Type: Bug
>          Components: Client, Job-Submission, JobManager, REST, Webfrontend
>    Affects Versions: 1.3.2
>         Environment: Flink 1.3.2 running a batch job in Kubernetes.
>            Reporter: Joshua Griffith
>            Priority: Minor
>
> When submitting a new {{FlinkPlan}} using the {{StandaloneClusterClient}}, 
> the JobManager's REST API appears to hang until the job is submitted. The 
> submission time may be large enough to cause timeouts if the 
> {{getStatistics}} and {{createInputSplits}} methods of a job's 
> {{RichInputFormat}} s perform time-intensive tasks like running external 
> queries. This is exacerbated when a job contains many such input formats, 
> since they appear to be initialized sequentially. For a particular job with 
> over 100 inputs, it's typical for the API (and consequently the web UI) to be 
> non-responsive for 45–60 seconds.
> Would it make sense for tasks to have a {{Configuring}} state before the 
> {{Created}} state to provide greater visibility and indicate that the 
> JobManager is still healthy?



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to