[
https://issues.apache.org/jira/browse/FLINK-1867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14505596#comment-14505596
]
ASF GitHub Bot commented on FLINK-1867:
---------------------------------------
Github user StephanEwen commented on a diff in the pull request:
https://github.com/apache/flink/pull/612#discussion_r28814574
--- Diff:
flink-tests/src/test/java/org/apache/flink/test/recovery/AbstractProcessFailureRecoveryTest.java
---
@@ -112,9 +112,9 @@ public void testTaskManagerProcessFailure() {
Tuple2<String, Object> localAddress = new
Tuple2<String, Object>("localhost", jobManagerPort);
Configuration jmConfig = new Configuration();
-
jmConfig.setString(ConfigConstants.AKKA_WATCH_HEARTBEAT_INTERVAL, "1 s");
-
jmConfig.setString(ConfigConstants.AKKA_WATCH_HEARTBEAT_PAUSE, "4 s");
-
jmConfig.setInteger(ConfigConstants.AKKA_WATCH_THRESHOLD, 2);
+
jmConfig.setString(ConfigConstants.AKKA_WATCH_HEARTBEAT_INTERVAL, "1 ms");
+
jmConfig.setString(ConfigConstants.AKKA_WATCH_HEARTBEAT_PAUSE, "20 s");
+
jmConfig.setInteger(ConfigConstants.AKKA_WATCH_THRESHOLD, 20);
--- End diff --
How long do the tests take now? With a pause of 20s and a threshold of 20,
how long does the JobManager take to realize that the TaskManager is down?
> TaskManagerFailureRecoveryITCase causes stalled travis builds
> -------------------------------------------------------------
>
> Key: FLINK-1867
> URL: https://issues.apache.org/jira/browse/FLINK-1867
> Project: Flink
> Issue Type: Bug
> Components: TaskManager, Tests
> Affects Versions: 0.9
> Reporter: Robert Metzger
> Assignee: Aljoscha Krettek
>
> There are currently tests on travis failing:
> https://travis-ci.org/apache/flink/jobs/57943063
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)