Yingjie Cao created FLINK-22910:
-----------------------------------

             Summary: ShuffleMaster enhancement for pluggable shuffle service 
framework
                 Key: FLINK-22910
                 URL: https://issues.apache.org/jira/browse/FLINK-22910
             Project: Flink
          Issue Type: Sub-task
          Components: Runtime / Coordination
            Reporter: Yingjie Cao
             Fix For: 1.14.0


The current _ShuffleMaster_ has an unclear lifecycle which is inconsistent with 
the _ShuffleEnvironment_ at the _TM_ side. Besides, it is hard to Implement 
some important capabilities for remote shuffle service. For example, 1) release 
external resources when a job finished; 2) Stop or start tracking some 
partitions depending on the status of the external service or system.

We drafted a document[1] which proposed some simple changes to solve these 
issues. The document is still not wholly completed yet. We will start a 
discussion once it is finished.

 

[1] 
https://docs.google.com/document/d/1_cHoapNbx_fJ7ZNraSqw4ZK1hMRiWWJDITuSZrdMDDs/edit?usp=sharing



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to