Re: Reduce function

Brad Tofel Mon, 18 Oct 2010 17:58:38 -0700

Whoops, just re-read your message, and see you may be asking abouttargeting a reduce callback function, not a reduce task..

If that's the case, I'm not sure I understand what your "bit/tag" isfor, and what you're trying to do with it. Can you provide a concreteexample (not necessarily code) of some keys which need to group together?

Is there a way to embed the "bit" within the value, so keys are alwayscommon?

If you really need to fake out the system so different keys arrive inthe same reduce, you might be able to do it with a combination of:


org.apache.hadoop.mapreduce.Job

.setSortComparatorClass()
.setGroupingComparatorClass()
.setPartitionerClass()

Brad

On 10/18/2010 05:41 PM, Brad Tofel wrote:

The "Partitioner" implementation used with your job should definewhich reduce target receives a given map output key.
I don't know if an existing Partitioner implementation exists whichmeets your needs, but it's not a very complex interface to develop, ifnothing existing works for you.
Brad

On 10/18/2010 04:43 PM, Shi Yu wrote:
How many tags you have? If you have several number of tags, you'dbetter create a Vector class to hold those tags. And define sumfunction to increment the values of tags. Then the value class shouldbe your new Vector class. That's better and more decent than theTextpair approach.
Shi

On 2010-10-18 5:19, Matthew John wrote:
Hi all,
I had a small doubt regarding the reduce module. What I understandis thatafter the shuffle / sort phase , all the records with the same keyvaluegoes into a reduce function. If thats the case, what is theattribute of the
Writable key which ensures that all the keys go to the same reduce ?
I am working on a reduce side Join where I need to tag all the keyswith a
bit which might vary but still want all those records to go into same
reduce. In Hadoop the Definitive Guide, pg. 235 they are usingTextPair forthe key. But I dont understand how the keys with different taginformation
goes into the same reduce.

Matthew

Re: Reduce function

Reply via email to