Re: Configuration and Hadoop cluster setup

Doug Cutting Fri, 25 May 2007 14:32:19 -0700

Phantom wrote:

If my Map job is going to process a file does it have to be in HDFS

No, but they usually are. Job inputs are resolved relative to thedefault filesystem. So, if you've configured the default filesystem tobe HDFS, and you pass a filename that's not qualified by a filesystem asthe input to your job, then your input should be in HDFS.

But inputs don't have to be in the default filesystem nor must they bein HDFS. They need to be in a filesystem that's available to all nodes.They could be in NFS, S3, or Ceph instead of HDFS. They could even bein a non-default HDFS system.

and if so how do I get it there ?


If HDFS is configured as your default filesystem:

  bin/hadoop fs -put localFileName nameInHdfs

Doug

Re: Configuration and Hadoop cluster setup

Reply via email to