[ 
https://issues.apache.org/jira/browse/TEZ-3270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ming Ma updated TEZ-3270:
-------------------------
    Summary: Data locality based scheduling policy in fair routing  (was: 
Scheduling policy in fair routing)

> Data locality based scheduling policy in fair routing
> -----------------------------------------------------
>
>                 Key: TEZ-3270
>                 URL: https://issues.apache.org/jira/browse/TEZ-3270
>             Project: Apache Tez
>          Issue Type: Sub-task
>            Reporter: Ming Ma
>
> One of the scheduling factors is data locality. For a given completed task A, 
> it is better to have its depending tasks in B run on the same host/container 
> to reduce the network data transfer between the two. In addition, it might be 
> better to pick larger partition task over smaller partition task. For 
> example, in the above fair routing diagram, after task A1 has completed, task 
> B1
> and/or task B2 can be scheduled on the same host/container as task A1Íž and B2 
> has higher priority than B1 given P2 is larger than P1.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to