Thanks Mike :) I was also wondering what if:
hdfs.CopyToLocal( src-file, dst-file) ; // is executed on node N and there exists a copy of src-file from the replication process in that same node(N) local file system ? Will hdfs recognize that there is already a copy in there and hence just move that copy to dst-file path ? OR Will hdfs go ahead with the copy and hence node N will have two copies of the src-file? (ie. one on HDFS namespace and another in the local file system) Thanks, Maha On Mar 2, 2011, at 12:38 PM, Michael Segel wrote: > > > Run is local to your edge machine where you launched your job. > It then connects to the cluster / job tracker ... > > HTH > > -Mike > >> From: [email protected] >> Subject: ToolRunner run function >> Date: Wed, 2 Mar 2011 12:10:05 -0800 >> To: [email protected] >> >> Hi, >> >> Assuming my program implements the ToolRunner, my question is where does >> the "run" function execute? ie. which daemon (DataNode/TT) ? or is it on >> the local machine where it is run? >> >> Thank you, >> Maha >
