[ 
https://issues.apache.org/jira/browse/FLINK-19343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17241382#comment-17241382
 ] 

Xuannan Su commented on FLINK-19343:
------------------------------------

Hi, Jin Xing. Thanks for your comments.
1. I agree with you that it is indeed weird for the "Interactive" relies on the 
clusterPartitionTracker for managing the lifecycle of the cluster partition, 
which is also a form of shuffle data. And to support cache across jobs, we need 
to have a component whose lifecycle outlive the job to manage the shuffle data. 
2. TBH, also, I wouldn't say I like the idea to pass the ShuffleDescriptor back 
to the client-side. But at that time being, as you say, we do not have a 
separate component to manage the lifecycle of shuffle data across jobs in the 
runtime. Therefore, the decision is made to support share shuffle data across 
jobs. To support caching in DataStream, it is a more clear design to have a 
component at runtime scope to manage the shuffle data across jobs. 
3. I don't think the remote shuffle data should be managed by the 
PartitionTracker as well. Instead, I think the ClusterPartition is just a kind 
of shuffle data and therefore should be managed by the ShuffleService.
I am pulling in [~chesnay]. He may have more insight from the ClusterPartition 
perspective. 

> FLIP-36: Support Interactive Programming in Flink
> -------------------------------------------------
>
>                 Key: FLINK-19343
>                 URL: https://issues.apache.org/jira/browse/FLINK-19343
>             Project: Flink
>          Issue Type: New Feature
>          Components: Table SQL / API
>            Reporter: Xuannan Su
>            Priority: Major
>
> Please refer to the FLIP for any details: 
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-36%3A+Support+Interactive+Programming+in+Flink



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to