Re: [Beowulf] Project Planning: Storage, Network, and Redundancy Considerations

John Hearns Mon, 19 Mar 2007 08:57:11 -0800

Brian R. Smith wrote:

Hey list,
1. Proprietary parallel storage systems (like Panasas, etc.): Itprovides the per-node bandwidth, aggregate bandwidth, cachingmechanisms, fault-tolerance, and redundancy that we require (plus havinga vendor offering 24x7x365 support & 24 hour turnover is quite a breathof fresh air for us). Price point is a little high for the amount ofstorage that we will get though, little more than doubling our currentoverall capacity. As far as I can tell, I can use this device as apermanent data store (like /home) and also as the user's scratch spaceso that there is only a single point for all data needs across thecluster. It does, however, require the installation of vendor kernelmodules which do often add overhead to system administration (as theyneed to be compiled, linked, and tested before every kernel update).


If you like Panasas, go with them.

The kernel module thing isn't all that a big deal - they are quitewilling to 'cook' the modules for you.

but YMMV

Our final problem is a relatively simple one though I am definitely anewbie to the H.A. world. Under this consolidation plan, we will haveonly one point of entry to this cluster and hence a single point offailure. Have any beowulfers had experience with deploying clusterswith redundant head nodes in a pseudo-H.A. fashion (heartbeatmonitoring, fail-over, etc.) and what experiences have you had inadapting your resource manager to this task? Would it simply be morefeasible to move the resource manager to another machine at this point(and have both headnodes act as submit and administrative clients)? Mycurrent plan is unfortunately light on the details of handling SGE insuch an environment. It includes purchasing two identical 1U boxes(with good support contracts). They will monitor each other foravailability and the goal is to have the spare take over if the masterfails. While the spare is not in use, I was planning on dispatchingjobs to it.


I have constructed several clusters using HA.

I believe Joe Landman has also - as you are in the States why not givesome thought to contacting Scalable and getting them to do some moredetailed designs for you?

For HA clusters, I have implemented several clusters using Linux-HA andheartbeat. This is an active/passive setup, with a primary and a backuphead node. On failover, the backup head node starts up cluster services.Failing over SGE is (relatively) easy - the main part is making surethat the cluster spool directory is on shared storage.

And mounting that share storage on one machine or the other :-)

The harder part is failing over NFS - again I've done it.
I gather there is a wrinkle or two with NFS v4 on Linux-HA type systems.


The second way to do this would be to look at using shared storage,

and using the Gridengine queue master failover mechanism. This is adifferent approach, in that you have two machines running, using eithera NAS type storage server or Panasas/Lustre. The SGE spool directory ison this, and the SGE qmaster will start on the second machine if thefirst fails to answer its heartbeat.



ps. 1U boxes? Think something a bit bigger - with hot swap PSUs.

You also might have to fit a second network card for your HA heartbeatlink (link plural - you need two links) plus a SCSI card, so thinkslightly bigger boxes for the two head nodes.You can spec 1U nodes for interactive login/compile/job submissionnodes. Maybe you could run a DNS round robin type load balancer forredundancy on these boxes - they should all be similar, and if one stopsworking then ho-hum.


pps. "when the spare is not in use dispatching jobs to it"

Actually, we also do a cold failover setup which is just like that, andthe backup node is used for running jobs when it is idle.





--
     John Hearns
     Senior HPC Engineer
     Streamline Computing,
     The Innovation Centre, Warwick Technology Park,
     Gallows Hill, Warwick CV34 6UW
     Office: 01926 623130 Mobile: 07841 231235
_______________________________________________
Beowulf mailing list, [email protected]
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Re: [Beowulf] Project Planning: Storage, Network, and Redundancy Considerations

Reply via email to