Hi, I was wondering if it is possible in hadoop to assign the same partition numbers to the map outputs. I am running a map-only job (with zero reducers) and hadoop shuffles the partitions in the output: i.e. input/part-m-0000X is processed by task number Y and hence generates output/part-m-0000Y (where X != Y).
Thanks Maysam
