In PigMapReduce.run(RecordReader input, OutputCollector output, Reporter reporter), as I can see, Pig does create its own OutputCollector and write output to its own files (using PigWriter). How does the shuffle process work if the files aren't created from the outputCollector supplied in run(RecordReader input, OutputCollector output, Reporter reporter)? Do we just put the output files to the location where shuffle expects?
Thanks for explanation in advance, Pi
