Re: GridEngine module for Hadoop on Demand

Daniel Templeton Fri, 08 May 2009 11:38:17 -0700

Thanks for the reply. Grid Engine works roughly the same way wrtparallel jobs, except that we call the tasks the master and the slaves.Grid Engine does not have a pbsdsh equivalent, but it would be a reallytrivial wrapper script to write for qrsh, which is pbsdsh minus theautomatic use of the nodes files (called the pe_hostfile in Grid Engine).

I assume from the 3-task minimum that the JobTracker gets a slot, theNameNode gets a slot, and there has to be at least one slot running aDataNode/TaskTracker. Correct? Should a single job be prevented fromrunning more than on hodring on a single host?

How do I go about contributing this Grid Engine extension to the HoDsource base?


Daniel

Hemanth Yamijala wrote:

Daniel Templeton wrote:
Hi,
I have a functioning module for Grid Engine for HoD, but some partsof it are currently hard-coded to my workstation. In cleaning upthose elements, I need some advice. Hopefully this is the right forum.
So, in the hodlib/NodePools/torque.py file, there's a runWorkers()method. In that method, it makes a single call to pbsdsh to startthe NameNode, DataNodes, JobTracker, and TaskTracker. I know nadaabout Torque, so please tell me if I'm interpreting this correctly.It would appear that the pbsdsh somehow reads out of the environmenthow many hodring processes it should start up and executes themremotely, and each hodring then figures out what service it should run.
Roughly right. In Torque, when a set of nodes are assigned to a job,the first node in that list is special (it's called mother superior -MS), the other nodes are called sisters. The job that's submitted totorque is a HOD process called 'ringmaster'. The ringmaster starts onthe MS and invokes runWorkers which executes pbsdsh. AFAIK, pbsdshreads the environment and gets a 'nodes' file that Torque writes out.This file contains all the sisters allocated for the job (includingthe MS). It executes the command passed to pbsdsh - another HODprocess, called hodring - on all of these nodes. The Hodring processeswork with the ringmaster and decide which service to run. In a sensethe ringmaster coordinates which service to start where, and informthe hodring to start that service.
In Grid Engine, the rough equivalent of pbsdsh is qrsh. (I think.)With qrsh, the master assigns the HoD job a set of nodes, and I thenhave to step through that set of nodes and qrsh to each one to startthe hodring services. As far as I can tell, the total number ofhodring services I need to start is 1 for the NameNode + 1 for theJobTracker + n for the DataNodes + m for the TaskTrackers.
HOD has a facility to use a HDFS service that's started outside ofHOD. In that mode, it does not start NameNode or DataNodes. Also, thenumber of DataNodes always equals the number of TaskTrackers (if HDFSservices are started with HOD).
The thing that I'm not grokking is how the hodrings know whatservices to start, and how I should be parceling them out across thenodes of the cluster.
This is decided by the ringmaster process. The logic is independent ofthe resource manager in use, and hence need not be worried about whenporting to a new resource manager.
Should I be making sure I have two hodrings per node, one for theDataNode and one of the TaskTracker?
No, a single hodring gets to start both the daemons.
If I were to go start a dozen hodrings, one on each of a dozenmachines, would they work out among themselves how many should beDataNodes and how many should be TaskTrackers? One more thing. Ifthe above is on the mark, that means you're consuming a queue slotfor each DataNode unless you use an external hdfs service. Thatseems like a waste of cluster resources since slots tend tocorrespond more to compute resources than I/O. I have to wonder ifit wouldn't be more efficient from a cluster perspective to have eachhodring start a DataNode and a TaskTracker. It would slightlyoversubscribe that job slot, but that may be better than grosslyundersubscribing two.
Explained above.

Thanks
Hemanth

Re: GridEngine module for Hadoop on Demand

Reply via email to