Hi,

I was wondering if it is possible in hadoop to assign the same partition
numbers to the map outputs. I am running a map-only job (with zero
reducers) and hadoop shuffles the partitions in the output: i.e.
input/part-m-0000X is processed by task number Y and hence generates
output/part-m-0000Y (where X != Y).

Thanks

Maysam

Reply via email to