The SequenceFile sorter is ok. It used to be the sort used in the shuffle. *grin*
Make sure to set io.sort.factor and io.sort.mb to appropriate values for your hardware. I'd usually use io.sort.factor as 25 * drives and io.sort.mb is the amount of memory you can allocate to the sorting. -- Owen
