have more data for it to process :)
On 2011-05-26, at 4:30 PM, Mohit Anchlia wrote: > I ran a simple pig script on this file: > > -rw-r--r-- 1 root root 208348 May 26 13:43 excite-small.log > > that orders the contents by name. But it only created one mapper. How > can I change this to distribute accross multiple machines? > > On Thu, May 26, 2011 at 3:08 PM, jagaran das <[email protected]> wrote: >> Hi Mohit, >> >> No of Maps - It depends on what is the Total File Size / Block Size >> No of Reducers - You can specify. >> >> Regards, >> Jagaran >> >> >> >> ________________________________ >> From: Mohit Anchlia <[email protected]> >> To: [email protected] >> Sent: Thu, 26 May, 2011 2:48:20 PM >> Subject: No. of Map and reduce tasks >> >> How can I tell how the map and reduce tasks were spread accross the >> cluster? I looked at the jobtracker web page but can't find that info. >> >> Also, can I specify how many map or reduce tasks I want to be launched? >> >> From what I understand is that it's based on the number of input files >> passed to hadoop. So if I have 4 files there will be 4 Map taks that >> will be launced and reducer is dependent on the hashpartitioner. >>
