Hi, I'm starting a project to build a 10 node cluster grid.
I've already successfully built a 10 node grid with hadoop 1.0.4. This next grid would preferrably be the 0.23.X branch, which I think would be the best version to smoothly transition to 2.0.3 release (right?) When I was working with the 1.0.4 proof-of-concept, I was scratching my head about the 'clients' role that submits jobs to the cluster, all the work then of `hadoop fs -put` I was doing directly from namenode instance. So the question: How do I setup a grid where clients could send jobs to the cluster in a queued fashion way, and how to setup the 'clients' to properly being acknowledged by the grid and being able to send jobs? Am I correct to think that 'client' could be anyone (my laptop in the network that reaches namenode) with access to the cluster with hadoop installed locally? Thanks in advance.