[jira] [Commented] (FLINK-5747) Eager Scheduling should deploy all Tasks together

ASF GitHub Bot (JIRA) Fri, 17 Feb 2017 04:40:32 -0800

    [ 
https://issues.apache.org/jira/browse/FLINK-5747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15871795#comment-15871795
 ]


ASF GitHub Bot commented on FLINK-5747:
---------------------------------------

Github user tillrohrmann commented on a diff in the pull request:

    https://github.com/apache/flink/pull/3295#discussion_r101737070
  
    --- Diff: 
flink-runtime/src/main/java/org/apache/flink/runtime/concurrent/FutureUtils.java
 ---
    @@ -88,4 +100,104 @@ public RetryException(Throwable cause) {
                        super(cause);
                }
        }
    +
    +   // 
------------------------------------------------------------------------
    +   //  composing futures
    +   // 
------------------------------------------------------------------------
    +
    +   /**
    +    * Creates a future that is complete once multiple other futures 
completed. 
    +    * The ConjunctFuture fails (completes exceptionally) once one of the 
Futures in the
    +    * conjunction fails.
    +    *
    +    * <p>The ConjunctFuture gives access to how many Futures in the 
conjunction have already
    +    * completed successfully, via {@link 
ConjunctFuture#getNumFuturesCompleted()}. 
    +    * 
    +    * @param futures The futures that make up the conjunction. No null 
entries are allowed.
    +    * @return The ConjunctFuture that completes once all given futures are 
complete (or one fails).
    +    */
    +   public static ConjunctFuture combineAll(Collection<? extends Future<?>> 
futures) {
    +           checkNotNull(futures, "futures");
    +           checkArgument(!futures.isEmpty(), "futures is empty");
    +
    +           final ConjunctFutureImpl conjunct = new 
ConjunctFutureImpl(futures.size());
    +
    +           for (Future<?> future : futures) {
    +                   future.handle(conjunct.completionHandler);
    +           }
    +
    +           return conjunct;
    +   }
    +
    +   /**
    +    * A future that is complete once multiple other futures completed. The 
futures are not
    +    * necessarily of the same type, which is why the type of this Future 
is {@code Void}.
    +    * The ConjunctFuture fails (completes exceptionally) once one of the 
Futures in the
    +    * conjunction fails.
    +    * 
    +    * <p>The advantage of using the ConjunctFuture over chaining all the 
futures (such as via
    +    * {@link Future#thenCombine(Future, BiFunction)}) is that 
ConjunctFuture also tracks how
    +    * many of the Futures are already complete.
    +    */
    +   public interface ConjunctFuture extends CompletableFuture<Void> {
    +
    +           /**
    +            * Gets the total number of Futures in the conjunction.
    +            * @return The total number of Futures in the conjunction.
    +            */
    +           int getNumFuturesTotal();
    +
    +           /**
    +            * Gets the number of Futures in the conjunction that are 
already complete.
    +            * @return The number of Futures in the conjunction that are 
already complete
    +            */
    +           int getNumFuturesCompleted();
    +   }
    +
    +   /**
    +    * The implementation of the {@link ConjunctFuture}.
    +    * 
    +    * <p>Implementation notice: The member fields all have package-private 
access, because they are
    +    * either accessed by an inner subclass or by the enclosing class.
    +    */
    +   private static class ConjunctFutureImpl extends 
FlinkCompletableFuture<Void> implements ConjunctFuture {
    --- End diff --
    
    I like the idea of the ConjunctFuture :-) I was wondering whether we can 
generalize it a little bit more by also collecting the actual values. Then the 
ConjunctFuture would effectively return a collection of the common base type of 
all registered futures. Here is a commit where I tried it out: 
https://github.com/tillrohrmann/flink/commit/f1f5ab63bfe75d629230e0fc2cf37d2499d85548.
 What do you think?


> Eager Scheduling should deploy all Tasks together
> -------------------------------------------------
>
>                 Key: FLINK-5747
>                 URL: https://issues.apache.org/jira/browse/FLINK-5747
>             Project: Flink
>          Issue Type: Bug
>          Components: JobManager
>    Affects Versions: 1.2.0
>            Reporter: Stephan Ewen
>            Assignee: Stephan Ewen
>             Fix For: 1.3.0
>
>
> Currently, eager scheduling immediately triggers the scheduling for all 
> vertices and their subtasks in topological order. 
> This has two problems:
>   - This works only, as long as resource acquisition is "synchronous". With 
> dynamic resource acquisition in FLIP-6, the resources are returned as Futures 
> which may complete out of order. This results in out-of-order (not in 
> topological order) scheduling of tasks which does not work for streaming.
>   - Deploying some tasks that depend on other tasks before it is clear that 
> the other tasks have resources as well leads to situations where many 
> deploy/recovery cycles happen before enough resources are available to get 
> the job running fully.
> For eager scheduling, we should allocate all resources in one chunk and then 
> deploy once we know that all are available.
> As a follow-up, the same should be done per pipelined component in lazy batch 
> scheduling as well. That way we get lazy scheduling across blocking 
> boundaries, and bulk (gang) scheduling in pipelined subgroups.
> This also does not apply for efforts of fine grained recovery, where 
> individual tasks request replacement resources.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

[jira] [Commented] (FLINK-5747) Eager Scheduling should deploy all Tasks together

Reply via email to