[ 
https://issues.apache.org/jira/browse/FLINK-4939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15628523#comment-15628523
 ] 

ASF GitHub Bot commented on FLINK-4939:
---------------------------------------

Github user zentol commented on a diff in the pull request:

    https://github.com/apache/flink/pull/2707#discussion_r86107589
  
    --- Diff: 
flink-streaming-java/src/main/java/org/apache/flink/streaming/runtime/operators/GenericWriteAheadSink.java
 ---
    @@ -27,27 +26,30 @@
     import org.apache.flink.runtime.state.CheckpointStreamFactory;
     import org.apache.flink.runtime.state.StreamStateHandle;
     import 
org.apache.flink.runtime.util.ReusingMutableToRegularIteratorWrapper;
    -import org.apache.flink.streaming.api.operators.StreamCheckpointedOperator;
     import org.apache.flink.streaming.api.operators.AbstractStreamOperator;
     import org.apache.flink.streaming.api.operators.OneInputStreamOperator;
    +import org.apache.flink.streaming.api.operators.StreamCheckpointedOperator;
     import org.apache.flink.streaming.runtime.streamrecord.StreamRecord;
     import org.apache.flink.util.InstantiationUtil;
    +import org.apache.flink.util.Preconditions;
     import org.slf4j.Logger;
     import org.slf4j.LoggerFactory;
     
     import java.io.IOException;
     import java.io.Serializable;
     import java.util.HashSet;
    +import java.util.Map;
     import java.util.Set;
     import java.util.TreeMap;
     import java.util.UUID;
     
     /**
    - * Generic Sink that emits its input elements into an arbitrary backend. 
This sink is integrated with the checkpointing
    - * mechanism and can provide exactly-once guarantees; depending on the 
storage backend and sink/committer implementation.
    + * Generic Sink that emits its input elements into an arbitrary backend. 
This sink is integrated with
    + * Flink's checkpointing mechanism and can provide exactly-once 
guarantees; depending on the storage
    --- End diff --
    
    the javadocs don't exceed the line limit, please don't reformat them.


> GenericWriteAheadSink: Decouple the creating from the committing subtask for 
> a pending checkpoint
> -------------------------------------------------------------------------------------------------
>
>                 Key: FLINK-4939
>                 URL: https://issues.apache.org/jira/browse/FLINK-4939
>             Project: Flink
>          Issue Type: Improvement
>          Components: Cassandra Connector
>            Reporter: Kostas Kloudas
>            Assignee: Kostas Kloudas
>             Fix For: 1.2.0
>
>
> So far the GenericWriteAheadSink expected that
> the subtask that wrote a pending checkpoint to the 
> state backend, will be also the one to commit it to
> the third-party storage system.
> This issue targets at removing this assumption. To do this 
> the CheckpointCommitter has to be able to dynamically
> take the subtaskIdx as a parameter when asking 
> if a checkpoint was committed and also change the
> state kept by the GenericWriteAheadSink to also 
> include that subtask index of the subtask that wrote 
> the pending checkpoint.
> This change is also necessary for making the operator rescalable.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to