increasing number of mappers.

Radim Kolar Wed, 09 Nov 2011 03:12:23 -0800

I have 2 input seq files 32MB each. I want to run them on as manymappers as possible.

i appended -D mapred.max.split.size=1000000 as command line argument tojob, but there is no difference. Job still runs on 2 mappers.


How split size works? Is max split size used for reading or writing files?

it works like this?: set maxsplitsize, write files and you will getbunch of seq files as output. then you will get same number of mappersas input files.

increasing number of mappers.

Reply via email to