Re: GridEngine module for Hadoop on Demand

Hemanth Yamijala Sat, 09 May 2009 02:57:10 -0700

Daniel Templeton wrote:

Thanks for the reply. Grid Engine works roughly the same way wrtparallel jobs, except that we call the tasks the master and theslaves. Grid Engine does not have a pbsdsh equivalent, but it wouldbe a really trivial wrapper script to write for qrsh, which is pbsdshminus the automatic use of the nodes files (called the pe_hostfile inGrid Engine).
I assume from the 3-task minimum that the JobTracker gets a slot, theNameNode gets a slot, and there has to be at least one slot running aDataNode/TaskTracker. Correct?

Yes

Should a single job be prevented from running more than on hodring ona single host?

More than one hodrings can be launched on a single host. However, thismeans more than 1 instance of a slave would get launched - like 2tasktrackers and 2 datanodes. In practice, we've seen that while this isalso OK, when we start running M/R tasks on such a system, it slows downthe system quite a bit. Hence I don't think this is really useful.

How do I go about contributing this Grid Engine extension to the HoDsource base?

Please feel free to submit a patch if you've figured out all thedetails. It should be against the current code base. Please refer tohttp://wiki.apache.org/hadoop/HowToContribute for details on contributing.


Thanks!
Hemanth

Hemanth Yamijala wrote:
Daniel Templeton wrote:
Hi,
I have a functioning module for Grid Engine for HoD, but some partsof it are currently hard-coded to my workstation. In cleaning upthose elements, I need some advice. Hopefully this is the right forum.
So, in the hodlib/NodePools/torque.py file, there's a runWorkers()method. In that method, it makes a single call to pbsdsh to startthe NameNode, DataNodes, JobTracker, and TaskTracker. I know nadaabout Torque, so please tell me if I'm interpreting this correctly.It would appear that the pbsdsh somehow reads out of the environmenthow many hodring processes it should start up and executes themremotely, and each hodring then figures out what service it should run.
Roughly right. In Torque, when a set of nodes are assigned to a job,the first node in that list is special (it's called mother superior -MS), the other nodes are called sisters. The job that's submitted totorque is a HOD process called 'ringmaster'. The ringmaster starts onthe MS and invokes runWorkers which executes pbsdsh. AFAIK, pbsdshreads the environment and gets a 'nodes' file that Torque writes out.This file contains all the sisters allocated for the job (includingthe MS). It executes the command passed to pbsdsh - another HODprocess, called hodring - on all of these nodes. The Hodringprocesses work with the ringmaster and decide which service to run.In a sense the ringmaster coordinates which service to start where,and inform the hodring to start that service.
In Grid Engine, the rough equivalent of pbsdsh is qrsh. (I think.)With qrsh, the master assigns the HoD job a set of nodes, and I thenhave to step through that set of nodes and qrsh to each one to startthe hodring services. As far as I can tell, the total number ofhodring services I need to start is 1 for the NameNode + 1 for theJobTracker + n for the DataNodes + m for the TaskTrackers.
HOD has a facility to use a HDFS service that's started outside ofHOD. In that mode, it does not start NameNode or DataNodes. Also, thenumber of DataNodes always equals the number of TaskTrackers (if HDFSservices are started with HOD).
The thing that I'm not grokking is how the hodrings know whatservices to start, and how I should be parceling them out across thenodes of the cluster.
This is decided by the ringmaster process. The logic is independentof the resource manager in use, and hence need not be worried aboutwhen porting to a new resource manager.
Should I be making sure I have two hodrings per node, one for theDataNode and one of the TaskTracker?
No, a single hodring gets to start both the daemons.
If I were to go start a dozen hodrings, one on each of a dozenmachines, would they work out among themselves how many should beDataNodes and how many should be TaskTrackers? One more thing. Ifthe above is on the mark, that means you're consuming a queue slotfor each DataNode unless you use an external hdfs service. Thatseems like a waste of cluster resources since slots tend tocorrespond more to compute resources than I/O. I have to wonder ifit wouldn't be more efficient from a cluster perspective to haveeach hodring start a DataNode and a TaskTracker. It would slightlyoversubscribe that job slot, but that may be better than grosslyundersubscribing two.
Explained above.

Thanks
Hemanth

Re: GridEngine module for Hadoop on Demand

Reply via email to