[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14065752#comment-14065752
 ] 

Chris Douglas commented on MAPREDUCE-5974:
------------------------------------------

Could this be equivalently implemented as a composite collector using the 
existing plugin architecture? Trading off collector implementations at runtime 
is a cool idea, but if the criteria are available to {{init}} then they're also 
available during submission (excluding availability of local dependencies or 
arch restrictions in heterogeneous clusters).

Changing strategies during the collection phase based on the records emitted 
seems to have equivalent or better potential, and is covered by the composite 
strategy, also.

> Allow map output collector fallback
> -----------------------------------
>
>                 Key: MAPREDUCE-5974
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5974
>             Project: Hadoop Map/Reduce
>          Issue Type: Sub-task
>          Components: task
>    Affects Versions: 2.6.0
>            Reporter: Todd Lipcon
>            Assignee: Todd Lipcon
>         Attachments: mapreduce-5974.txt
>
>
> Currently we only allow specifying a single MapOutputCollector implementation 
> class in a job. It would be nice to allow a comma-separated list of classes: 
> we should try each collector implementation in the user-specified order until 
> we find one that can be successfully instantiated and initted.
> This is useful for cases where a particular optimized collector 
> implementation cannot operate on all key/value types, or requires native 
> code. The cluster administrator can configure the cluster to try to use the 
> optimized collector and fall back to the default collector.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to