[ 
https://issues.apache.org/jira/browse/HADOOP-1339?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12574190#action_12574190
 ] 

Amar Kamat commented on HADOOP-1339:
------------------------------------

Does it makes sense to spawn a thread from the task tracker rather than a 
separate jvm? The reason being that the shuffle code is again a framework code. 

> Shuffle should be refactored to a separate task by itself
> ---------------------------------------------------------
>
>                 Key: HADOOP-1339
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1339
>             Project: Hadoop Core
>          Issue Type: New Feature
>          Components: mapred
>            Reporter: Devaraj Das
>
> Currently, shuffle phase is part of the reduce task. The idea here is to move 
> out the shuffle as a first-class task. This will improve the usage of the 
> network since we will then be able to schedule shuffle tasks independently, 
> and later on pin reduce tasks to those nodes. This will make most sense for 
> apps where there are multiple waves of reduces (the second wave of reduces 
> can directly start off doing the "reducer" phase).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to