Thanks Todd.

I wasn't sure if that is possible. But you pointed out an important point and that is it is just NN and JT that would run remotely.

So in order to do this would I just install the complete hadoop instance on each one. And then would they be configed as masters?

Or should NameNode and JobTracker run on the same machine? So there would be one master.

So when I start the cluster would I start it from the NN/JT machine. Could it also be started from any of the other cluster members.

sorry for all of the seemingly basic questions, but want to get it right the first time:)

-John



On Nov 9, 2009, at 1:11 PM, Todd Lipcon wrote:

On Mon, Nov 9, 2009 at 7:20 AM, John Martyniak <[email protected]
wrote:


Can the NameNode/DataNode & JobTracker/TaskTracker run on a server that isn't part of the "cluster" meaning I would like to run it on a machine that wouldn't participate in the processing of data, and wouldn't participate in the HDFS data sharing, and would solely focus on the NameNode/ DataNode &
JobTracker/TaskTracker tasks.


Yes, running the NN and the JT on servers that don't also run TT/DN is very
common and recommended for clusters of more than maybe 5 nodes.

-Todd

Reply via email to