[
https://issues.apache.org/jira/browse/FLINK-13244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17328391#comment-17328391
]
Flink Jira Bot commented on FLINK-13244:
----------------------------------------
This major issue is unassigned and itself and all of its Sub-Tasks have not
been updated for 30 days. So, it has been labeled "stale-major". If this ticket
is indeed "major", please either assign yourself or give an update. Afterwards,
please remove the label. In 7 days the issue will be deprioritized.
> Delayed Scheduler in Flink
> --------------------------
>
> Key: FLINK-13244
> URL: https://issues.apache.org/jira/browse/FLINK-13244
> Project: Flink
> Issue Type: Improvement
> Components: flink-contrib
> Reporter: Mridul Verma
> Priority: Major
> Labels: stale-major
>
> Currently with the flink scheduler and local splits
> * Suppose i use LocalSplitter to create splits for per hostname
> * Now a host requests for a local splits, there are chances that this node
> will get a local split but in case of no local split present , this node
> might get remote split and hence the data locality is of no use in these
> cases because there might be just some other node just round the corner who
> was just going to ask for its own local split but because this node asked for
> the split first , both the hosts/node will be given remote splits and hence
> the overall throughput of the system might decrease.
> * Proposal is to use Delayed scheduling. This has shown to be quite
> effective against these cases and might help us increase the overall
> throughput given the latency difference between execution of local and remote
> split is significant.
> * [https://cs.stanford.edu/~matei/papers/2010/eurosys_delay_scheduling.pdf]
> *
--
This message was sent by Atlassian Jira
(v8.3.4#803005)