I want to use TotalOrderPartitioner to produce globally sorted results for reducers. As I know, this partitioner needs a partition file which is generated by input samplers. But it seems that all these samplers can only sample input data. Why doesn't samplers sample data from mappers' output? I think that would be more useful.
I'm new to Hadoop, please correct me if I'm wrong. Thanks in advance. -- View this message in context: http://old.nabble.com/how-to-use-InputSampler---TotalOrderPartitioner--tp27023687p27023687.html Sent from the Hadoop core-user mailing list archive at Nabble.com.
