Re: Running on multiple CPU's

Ken Krugler Mon, 16 Apr 2007 10:18:35 -0700

At 9:41 am -0700 4/16/07, Doug Cutting wrote:

Eelco Lempsink wrote:
Inspired byhttp://www.mail-archive.com/[EMAIL PROTECTED]/msg02394.htmlI'm trying to run Hadoop on multiple CPU's, but without using HDFS.
To be clear: you need some sort of shared filesystem, if not HDFS,then NFS, S3, or something else. For example, the job clientinteracts with the job tracker by copying files to the sharedfilesystem named by fs.default.name, and job inputs and outputs areassumed to come from a shared filesystem.
So, if you're using NFS, then you'd set fs.default.name to somethinglike "file:///mnt/shared/hadoop/". Note also that as your clustergrows, NFS will soon become a bottleneck. That's why HDFS isprovided: there aren't other readily available shared filesystemsthat scale appropriately.

Has anybody been using Hadoop with ZFS? Would ZFS count as a readilyavailable shared file system that scales appropriately?


Thanks,

-- Ken
--
Ken Krugler
Krugle, Inc.
+1 530-210-6378
"Find Code, Find Answers"

Re: Running on multiple CPU's

Reply via email to