Ryan,

You can surely do this by having 4 virtual machines (I havent tried that
myself though). However, make sure you understand the principles behind
hadoop. The fundamental reason to go towards hadoop is to distribute the
load across multiple machines. Everything that hadoop provides is with that
basic idea in mind. Replication basically provides tolerance towards failing
hard drives. With 2 hard drives, a replication factor of >2 wont add value.
Secondly, HDFS is slower than the filesystem you would run it on... So,
throughputs would be low.

So, make sure you understand the design philosophy and then move forward on
it.

-ak



On Thu, Jul 30, 2009 at 6:15 PM, Ryan Smith <[email protected]>wrote:

> Sorry if this has been asked already, I didnt find anything on google.
>
> I want to run a dfs cluster on one machine.  8 core processor, 8 GB RAM, 3
> x
> 2TB drives.  I want triple replication on the dfs.
> If i make 4 IP aliases on the machine, (10.1.2.30,10.1.2.31, 10.1.2.32,
> 10.1.2.33) can i run 1 master (NN & SNN) and 3 slaves (3 DN)?
> Or what is the best approach?  I want to make a private hadoop storage for
> myself, but i dont want to buy a bunch of servers.
> Thanks, any advice is appreciated.
> -Ryan
>

Reply via email to