This is an automated email from the ASF dual-hosted git repository.
xtsong pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/flink.git
The following commit(s) were added to refs/heads/master by this push:
new 4a7db1c [FLINK-22335][runtime][config] Increase default resource wait
timeout for the adaptive scheduler.
4a7db1c is described below
commit 4a7db1c6b81d17eccb8ae582f0c94e006392cbec
Author: Xintong Song <[email protected]>
AuthorDate: Mon Apr 19 11:24:25 2021 +0800
[FLINK-22335][runtime][config] Increase default resource wait timeout for
the adaptive scheduler.
This closes #15657
---
docs/layouts/shortcodes/generated/all_jobmanager_section.html | 4 ++--
docs/layouts/shortcodes/generated/expert_scheduling_section.html | 4 ++--
docs/layouts/shortcodes/generated/job_manager_configuration.html | 4 ++--
.../main/java/org/apache/flink/configuration/JobManagerOptions.java | 2 +-
4 files changed, 7 insertions(+), 7 deletions(-)
diff --git a/docs/layouts/shortcodes/generated/all_jobmanager_section.html
b/docs/layouts/shortcodes/generated/all_jobmanager_section.html
index 5d4666c..d542cab 100644
--- a/docs/layouts/shortcodes/generated/all_jobmanager_section.html
+++ b/docs/layouts/shortcodes/generated/all_jobmanager_section.html
@@ -16,13 +16,13 @@
</tr>
<tr>
<td><h5>jobmanager.adaptive-scheduler.resource-stabilization-timeout</h5></td>
- <td style="word-wrap: break-word;">10 s</td>
+ <td style="word-wrap: break-word;">5 min</td>
<td>Duration</td>
<td>The resource stabilization timeout defines the time the
JobManager will wait if fewer than the desired but sufficient resources are
available. The timeout starts once sufficient resources for running the job are
available. Once this timeout has passed, the job will start executing with the
available resources.<br />If <code
class="highlighter-rouge">scheduler-mode</code> is configured to <code
class="highlighter-rouge">REACTIVE</code>, this configuration value will defaul
[...]
</tr>
<tr>
<td><h5>jobmanager.adaptive-scheduler.resource-wait-timeout</h5></td>
- <td style="word-wrap: break-word;">10 s</td>
+ <td style="word-wrap: break-word;">5 min</td>
<td>Duration</td>
<td>The maximum time the JobManager will wait to acquire all
required resources after a job submission or restart. Once elapsed it will try
to run the job with a lower parallelism, or fail if the minimum amount of
resources could not be acquired.<br />Increasing this value will make the
cluster more resilient against temporary resources shortages (e.g., there is
more time for a failed TaskManager to be restarted).<br />Setting a negative
duration will disable the resource tim [...]
</tr>
diff --git a/docs/layouts/shortcodes/generated/expert_scheduling_section.html
b/docs/layouts/shortcodes/generated/expert_scheduling_section.html
index 409ecb9..3e7807b 100644
--- a/docs/layouts/shortcodes/generated/expert_scheduling_section.html
+++ b/docs/layouts/shortcodes/generated/expert_scheduling_section.html
@@ -22,13 +22,13 @@
</tr>
<tr>
<td><h5>jobmanager.adaptive-scheduler.resource-stabilization-timeout</h5></td>
- <td style="word-wrap: break-word;">10 s</td>
+ <td style="word-wrap: break-word;">5 min</td>
<td>Duration</td>
<td>The resource stabilization timeout defines the time the
JobManager will wait if fewer than the desired but sufficient resources are
available. The timeout starts once sufficient resources for running the job are
available. Once this timeout has passed, the job will start executing with the
available resources.<br />If <code
class="highlighter-rouge">scheduler-mode</code> is configured to <code
class="highlighter-rouge">REACTIVE</code>, this configuration value will defaul
[...]
</tr>
<tr>
<td><h5>jobmanager.adaptive-scheduler.resource-wait-timeout</h5></td>
- <td style="word-wrap: break-word;">10 s</td>
+ <td style="word-wrap: break-word;">5 min</td>
<td>Duration</td>
<td>The maximum time the JobManager will wait to acquire all
required resources after a job submission or restart. Once elapsed it will try
to run the job with a lower parallelism, or fail if the minimum amount of
resources could not be acquired.<br />Increasing this value will make the
cluster more resilient against temporary resources shortages (e.g., there is
more time for a failed TaskManager to be restarted).<br />Setting a negative
duration will disable the resource tim [...]
</tr>
diff --git a/docs/layouts/shortcodes/generated/job_manager_configuration.html
b/docs/layouts/shortcodes/generated/job_manager_configuration.html
index 2a82ba6..619e86e 100644
--- a/docs/layouts/shortcodes/generated/job_manager_configuration.html
+++ b/docs/layouts/shortcodes/generated/job_manager_configuration.html
@@ -16,13 +16,13 @@
</tr>
<tr>
<td><h5>jobmanager.adaptive-scheduler.resource-stabilization-timeout</h5></td>
- <td style="word-wrap: break-word;">10 s</td>
+ <td style="word-wrap: break-word;">5 min</td>
<td>Duration</td>
<td>The resource stabilization timeout defines the time the
JobManager will wait if fewer than the desired but sufficient resources are
available. The timeout starts once sufficient resources for running the job are
available. Once this timeout has passed, the job will start executing with the
available resources.<br />If <code
class="highlighter-rouge">scheduler-mode</code> is configured to <code
class="highlighter-rouge">REACTIVE</code>, this configuration value will defaul
[...]
</tr>
<tr>
<td><h5>jobmanager.adaptive-scheduler.resource-wait-timeout</h5></td>
- <td style="word-wrap: break-word;">10 s</td>
+ <td style="word-wrap: break-word;">5 min</td>
<td>Duration</td>
<td>The maximum time the JobManager will wait to acquire all
required resources after a job submission or restart. Once elapsed it will try
to run the job with a lower parallelism, or fail if the minimum amount of
resources could not be acquired.<br />Increasing this value will make the
cluster more resilient against temporary resources shortages (e.g., there is
more time for a failed TaskManager to be restarted).<br />Setting a negative
duration will disable the resource tim [...]
</tr>
diff --git
a/flink-core/src/main/java/org/apache/flink/configuration/JobManagerOptions.java
b/flink-core/src/main/java/org/apache/flink/configuration/JobManagerOptions.java
index 2927d9f..4cb52e5 100644
---
a/flink-core/src/main/java/org/apache/flink/configuration/JobManagerOptions.java
+++
b/flink-core/src/main/java/org/apache/flink/configuration/JobManagerOptions.java
@@ -401,7 +401,7 @@ public class JobManagerOptions {
public static final ConfigOption<Duration> RESOURCE_WAIT_TIMEOUT =
key("jobmanager.adaptive-scheduler.resource-wait-timeout")
.durationType()
- .defaultValue(Duration.ofSeconds(10))
+ .defaultValue(Duration.ofMinutes(5))
.withDescription(
Description.builder()
.text(