Re: Running on multiple CPU's

Doug Cutting Mon, 16 Apr 2007 09:41:47 -0700

Eelco Lempsink wrote:

Inspired byhttp://www.mail-archive.com/[EMAIL PROTECTED]/msg02394.htmlI'm trying to run Hadoop on multiple CPU's, but without using HDFS.

To be clear: you need some sort of shared filesystem, if not HDFS, thenNFS, S3, or something else. For example, the job client interacts withthe job tracker by copying files to the shared filesystem named byfs.default.name, and job inputs and outputs are assumed to come from ashared filesystem.

So, if you're using NFS, then you'd set fs.default.name to somethinglike "file:///mnt/shared/hadoop/". Note also that as your clustergrows, NFS will soon become a bottleneck. That's why HDFS is provided:there aren't other readily available shared filesystems that scaleappropriately.


Doug

Re: Running on multiple CPU's

Reply via email to