If the file size is less than a block size, then file isn't "spaning" across nodes. Files are split at block size points, so your file is essentially just one block here.
Also see http://search-hadoop.com/m/tGBgk1WFVAO1 for your block location question. You can get the node list of replicas this way, but not the explicit local paths. On Wed, Feb 1, 2012 at 8:48 PM, Qiming He <[email protected]> wrote: > Hi all, > > Is there anyway (command) to determine the physical location of a file in > HDFS to see it spans over multiple nodes? and any way to force a small file > not to span over two nodes? assuming its size is smaller than default block > size (e.g., 64MB). > > Thanks in advance > > -Qiming -- Harsh J Customer Ops. Engineer Cloudera | http://tiny.cloudera.com/about
