subject:"Retrieve and compute input splits"

Re: Retrieve and compute input splits

2013-09-30 Thread Sai Sai

TextInuptFormat the default one. Any suggestions. Thanks Sai From: Jay Vyas To: "common-u...@hadoop.apache.org" Cc: Sai Sai Sent: Saturday, 28 September 2013 5:35 AM Subject: Re: Retrieve and compute input splits Technically, the block locations are p

Re: Retrieve and compute input splits

2013-09-27 Thread Jay Vyas

Technically, the block locations are provided by the InputSplit which in the FileInputFormat case, is provided by the FileSystem Interface. http://hadoop.apache.org/docs/current/api/org/apache/hadoop/mapred/InputSplit.html The thing to realize here is that the FileSystem implementation is provide

Re: Retrieve and compute input splits

2013-09-27 Thread Peyman Mohajerian

For the JobClient to compute the input splits doesn't it need to contact Name Node. Only Name Node knows where the splits are, how can it compute it without that additional call? On Fri, Sep 27, 2013 at 1:41 AM, Sonal Goyal wrote: > The input splits are not copied, only the information on the l

Re: Retrieve and compute input splits

2013-09-27 Thread Sonal Goyal

The input splits are not copied, only the information on the location of the splits is copied to the jobtracker so that it can assign tasktrackers which are local to the split. Check the Job Initialization section at http://answers.oreilly.com/topic/459-anatomy-of-a-mapreduce-job-run-with-hadoop/