Re: Do people put their master node in the slave list - 0.15.1

Jason Venner Wed, 26 Dec 2007 12:11:53 -0800

This seems to be more a function of input file size than anything. I hada single (uncompressed) 35gig input file Text,Value.


Jason Venner wrote:

I have been experimenting with that, and when I do, the mastersaturates well before the slave nodes, and the jobs start experiencingtimeouts
The map task in question is the IdentityMapper, this job is a simplemerge sort, combining data by key where there are duplicate keys inthe input stream.There is no swapping going on in my cluster, and the machines inquestion are all 8 processor boxes, and the tasks.maximum was set to 6.
task_200712261033_0002_m_000078_0: Exception in thread "main"java.net.SocketTimeoutException: timed out waiting for rpc responsetask_200712261033_0002_m_000078_0: atorg.apache.hadoop.ipc.Client.call(Client.java:484)task_200712261033_0002_m_000078_0: atorg.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:184)task_200712261033_0002_m_000078_0: atorg.apache.hadoop.mapred.$Proxy0.getTask(Unknown Source)task_200712261033_0002_m_000078_0: atorg.apache.hadoop.mapred.TaskTracker$Child.main(TaskTracker.java:1747)07/12/26 10:48:03 INFO mapred.JobClient: Task Id :task_200712261033_0002_m_000081_1, Status : FAILED

Re: Do people put their master node in the slave list - 0.15.1

Reply via email to