Hi all,

I have recently started using Hadoop streaming. From the documentation, I
understand that by default, each line output from a mapper up to the first
tab becomes the key and rest of the line is the value. I wanted to know that
between the mapper and reducer, is there a shuffling(sorting) phase? More
specifically, Would it be correct to assume that output from all mappers
with the same key will go to the same reducer?

Thanks,
Nipun

Reply via email to