Dear Whirr Developers and Users I'm using Whirr to run a hadoop cluster on AWS EC2, and I met a problem. After launching a hadoop cluster, I logged into the master instance using putty, get to the path /usr/local/hadoop/conf, and checked the hdfs-site.xml. I found that the 'dfs.data.dir' was set to '/data0/hadoop/hdfs/data'.
The /data0 folder is in the instance's /dev/sda1 device, which is very small. If the hadoop store the HDFS data here, the space will not be enough for my big data. The EC2 instances have another device /dev/sdb, which is much bigger and has been mounted to /mnt. So I think the 'dfs.data.dir' should be set to a folder under /mnt (and also some other properties such as 'hadoop.tmp.dir', 'mapred.local.dir' etc.) Could anyone tell me how to do this using Whirr? Thank you very much! Best regards, Jingchen LIU
