Hadoop sorting algorithm on equal keys

Teodor Macicas Tue, 24 Aug 2010 02:22:16 -0700

Hello,

Let's say that we have two maps outputs which will be sorted before thereducer will start. Doesn't matter what {a,b0,b1,c} mean, but let'sassume that b0=b1.

Map output1 : a, b0
Map output2:  c, b1
In this case we can have 2 different sets of sorted data:
1. {a,b0,b1,c}  and
2. {a,b1,b0,c}  since b0=b1 .

In my particular problem I want to distingush between b0 and b1.Basically, they are numbers but I have extra-info on which my comparisonwill be made.Now, the question is: how can I change Hadoop default behaviour in orderto control the sorting algorithm on equal keys ?


Thank you in advance.
Best,
Teodor

Hadoop sorting algorithm on equal keys

Reply via email to