To add new nodes you can just start up the datanodes and point them to
the namenode (tasktracker to jobtracker). They will join the cluster
and any current jobs can continue.
If you want to be able to start and stop them from a single machine
(doesn't necessarily need to be the namenode) you will need to setup
both ssh keys from that single machines to all slaves and add the slave
machines to the slaves file.
Dennis Kubes
Phantom wrote:
Hi
I had a question about ways of setting up large clusters. I did read the
WIKI which has a posting on this matter and I have also been through the
exercise of setting up a cluster of 15 nodes. If I were to scale that
out to
100 nodes do I need to manually add the new nodes to the slaves file and
bounce the master server ? Or can I just start the task tracker on the new
nodes pointing them to the master ? If I have to bounce the master every
time I scale out my cluster what happens to the jobs that are currently
running ? Could someone please enlighten me regarding this ?
Thanks in advance
Avinash