Hi, there, I am interested in the implementation details of hadoop mapred. In particular, does the reducer wait till a map task ends and then fetch the output (key-value pairs)? If so, is the very file produced by a mapper for the reducer sorted before reducer gets it? (which means that the reducer only needs to do merge sort when it gets all the intermediate files from different mappers).
Best, -Kevin
