On a pseudo distributed mode, it actually just "move" the copy and not reproduce it :) Thanks anyways,
Maha On Mar 2, 2011, at 1:04 PM, maha wrote: > Thanks Mike :) > > I was also wondering what if: > > hdfs.CopyToLocal( src-file, dst-file) ; // is executed on node N > > and there exists a copy of src-file from the replication process in that same > node(N) local file system ? > > Will hdfs recognize that there is already a copy in there and hence just move > that copy to dst-file path ? > OR > Will hdfs go ahead with the copy and hence node N will have two copies of the > src-file? (ie. one on HDFS namespace and another in the local file system) > > > Thanks, > > Maha > > On Mar 2, 2011, at 12:38 PM, Michael Segel wrote: > >> >> >> Run is local to your edge machine where you launched your job. >> It then connects to the cluster / job tracker ... >> >> HTH >> >> -Mike >> >>> From: [email protected] >>> Subject: ToolRunner run function >>> Date: Wed, 2 Mar 2011 12:10:05 -0800 >>> To: [email protected] >>> >>> Hi, >>> >>> Assuming my program implements the ToolRunner, my question is where does >>> the "run" function execute? ie. which daemon (DataNode/TT) ? or is it on >>> the local machine where it is run? >>> >>> Thank you, >>> Maha >> >
