[
https://issues.apache.org/jira/browse/HADOOP-2960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Allen Wittenauer resolved HADOOP-2960.
--------------------------------------
Resolution: Won't Fix
Closing at won't fix, given the -1.
> A mapper should use some heuristics to decide whether to run the combiner
> during spills
> ---------------------------------------------------------------------------------------
>
> Key: HADOOP-2960
> URL: https://issues.apache.org/jira/browse/HADOOP-2960
> Project: Hadoop Common
> Issue Type: Bug
> Reporter: Runping Qi
>
> Right now, the combiner, if set, will be called for each spill, no mapper
> whether the combiner can actually reduce the values.
> The mapper should use some heuristics to decide whether to run the combiner
> during spills.
> One of such heuristics is to check the the ratio of the nymber of keys to
> the number of unique keys in the spill.
> The combiner will be called only if that ration exceeds certain threshold
> (say 2).
--
This message was sent by Atlassian JIRA
(v6.2#6252)