What's the difference between setNumMapTasks and mapred.map.tasks? On Fri, Mar 9, 2012 at 5:00 PM, Chen He <airb...@gmail.com> wrote:
> Hi Mohit > > " mapred.tasktracker.reduce(map).tasks.maximum " means how many reduce(map) > slot(s) you can have on each tasktracker. > > "mapred.job.reduce(maps)" means default number of reduce (map) tasks your > job will has. > > To set the number of mappers in your application. You can write like this: > > *configuration.setNumMapTasks(the number you want);* > > Chen > > Actually, you can just use configuration.set() > > On Fri, Mar 9, 2012 at 6:42 PM, Mohit Anchlia <mohitanch...@gmail.com > >wrote: > > > What's the difference between mapred.tasktracker.reduce.tasks.maximum and > > mapred.map.tasks > > ** > > I want my data to be split against only 10 mappers in the entire > cluster. > > Can I do that using one of the above parameters? > > >