Github user tillrohrmann commented on the issue:
https://github.com/apache/flink/pull/4903
Akka's default value is actually 5 seconds, which I think is a bit too high.
I actually tried to backtrack an instability in the `JobManagerFailsITCase`
and noticed that this test took roughly 16 s to execute (the ITCase contains
only 2 tests where we restart the JM). Part of the reason was that Akka gated
the JobManager ActorSystem for 5 seconds after we let the JM fail.
The actual solution to speed up this test was then to don't reuse the same
port for the new JobManager system, but I couldn't think of a good reason to
keep the 5 seconds default. Moreover, some other tests which also run into the
case of gated connections could also benefit from that. I think lowering the
gated interval should allow us to reestablish a lost connection faster.
---