Dear Whirr Developers and Users

I'm using Whirr to run a hadoop cluster on AWS EC2, and I met a problem.
After launching a hadoop cluster, I logged into the master instance using
putty, get to the path /usr/local/hadoop/conf,
and checked the hdfs-site.xml. I found that the 'dfs.data.dir' was set to
'/data0/hadoop/hdfs/data'.

The /data0 folder is in the instance's /dev/sda1 device, which is very
small. If the hadoop store the HDFS data here, the space will not be enough
for my big data.
The EC2 instances have another device /dev/sdb, which is much bigger and
has been mounted to /mnt.
So I think the 'dfs.data.dir' should be set to a folder under /mnt (and
also some other properties such as 'hadoop.tmp.dir', 'mapred.local.dir'
etc.)
Could anyone tell me how to do this using Whirr?

Thank you very much!

Best regards,

Jingchen LIU

Reply via email to