[ 
https://issues.apache.org/jira/browse/FLINK-7914?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16220136#comment-16220136
 ] 

ASF GitHub Bot commented on FLINK-7914:
---------------------------------------

Github user tillrohrmann commented on the issue:

    https://github.com/apache/flink/pull/4903
  
    Akka's default value is actually 5 seconds, which I think is a bit too high.
    
    I actually tried to backtrack an instability in the `JobManagerFailsITCase` 
and noticed that this test took roughly 16 s to execute (the ITCase contains 
only 2 tests where we restart the JM). Part of the reason was that Akka gated 
the JobManager ActorSystem for 5 seconds after we let the JM fail.
    
    The actual solution to speed up this test was then to don't reuse the same 
port for the new JobManager system, but I couldn't think of a good reason to 
keep the 5 seconds default. Moreover, some other tests which also run into the 
case of gated connections could also benefit from that. I think lowering the 
gated interval should allow us to reestablish a lost connection faster.


> Expose Akka gated interval as user option
> -----------------------------------------
>
>                 Key: FLINK-7914
>                 URL: https://issues.apache.org/jira/browse/FLINK-7914
>             Project: Flink
>          Issue Type: Improvement
>          Components: Distributed Coordination
>    Affects Versions: 1.4.0
>            Reporter: Till Rohrmann
>            Assignee: Till Rohrmann
>            Priority: Minor
>
> Whenever Akka loses its connection to a remote {{ActorSystem}} it gates the 
> corresponding address. The default value is {{5 s}}. Especially for tests 
> this can be too high. Therefore, I propose to expose this option to the user 
> via the {{AkkaOptions}} and setting it to {{50 ms}} per default.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to