[ 
https://issues.apache.org/jira/browse/MAPREDUCE-7168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mikayla Konst updated MAPREDUCE-7168:
-------------------------------------
    Attachment: MAPREDUCE-7168.patch
        Status: Patch Available  (was: Open)

> Add option to not kill already-done map tasks when node becomes unusable
> ------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-7168
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-7168
>             Project: Hadoop Map/Reduce
>          Issue Type: New Feature
>          Components: mrv2
>    Affects Versions: 2.9.2
>         Environment: Google Compute Engine (Dataproc), Java 8
>            Reporter: Mikayla Konst
>            Priority: Minor
>         Attachments: MAPREDUCE-7168.patch
>
>
> When a node becomes unusable, if there are still reduce tasks running, all 
> completed map tasks that were run on that node are killed so that they can be 
> re-run on a different node. This is because the node can no longer serve 
> shuffle data, so the map task output cannot be fetched by the reducers.
> If map tasks do not write their shuffle data locally, killing already-done 
> map tasks will make the job lose map progress unnecessarily. This change 
> prevents map progress from being lost when shuffle data is not written 
> locally by providing a property mapreduce.map.rerun-if-node-unusable that can 
> be set to false to prevent killing already-done map tasks.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org

Reply via email to