Re: Namenode Exceptions with S3

Doug Cutting Thu, 17 Jul 2008 10:17:10 -0700

Tom White wrote:

You can allow S3 as the default FS, it's just that then you can't run
HDFS at all in this case. You would only do this if you don't want to
use HDFS at all, for example, if you were running a MapReduce job
which read from S3 and wrote to S3.

Can't one work around this by using a different configuration on theclient than on the namenodes and datanodes? The client should be ableto set fs.default.name to an s3: uri, while the namenode and datanodemust have it set to an hdfs: uri, no?

Would it be useful to add command-line options to namenode and datanodethat override the configuration, so that one could start non-defaultHDFS daemons?

It might be less confusing if the HDFS daemons didn't use
fs.default.name to define the namenode host and port. Just like
mapred.job.tracker defines the host and port for the jobtracker,
dfs.namenode.address (or similar) could define the namenode. Would
this be a good change to make?

Probably. For back-compatibility we could leave it empty by default,deferring to fs.default.name, only if folks specify a non-emptydfs.namenode.address would it be used.


Doug

Re: Namenode Exceptions with S3

Reply via email to