Re: [gentoo-cluster] Gentoo clustering "?? la" www.clustermatic.org (PXE booted nodes)

Brady Catherman Mon, 21 Nov 2005 09:55:09 -0800

My experience with NFS mounted roots is that they can bombard yournetwork with packets. A simple little script can manage to generateenough traffic to actually slow down other services just by hittingtons of services. Plus if you have spoolers and such you end upgenerating a ton of traffic or memory. The advantage to a localinstall is that you can cut down on the network traffic drastically.Granted, if your applications are all embarrassingly parallel anddon't do a ton of disk IO then NFS root works great.. Many of theapplications we use here would utterly destroy the network if runfrom a NFS mounted root.. The advantage of rebuilding is consistencywithout the disadvantage of NFS roots.


On Nov 21, 2005, at 9:45 AM, Eric Thibodeau wrote:

Le 21 Novembre 2005 11:10, Robin H. Johnson a écrit :
On Sun, Nov 20, 2005 at 08:51:13PM -0500, St?phane Lacasse wrote:
[snip discussion about installing]
I've done the cluster system (128 node+ 1 master) in a similarfashion
to what you are after.
1. PXE-boot install environment for performing installs of both the
master and all of the nodes.
PXE-boot even for the Master?...so where do the images reside...howdo youmanage the slightly varying config items such as hostname and all?Thisapproach still seems a little bit time consuming since all nodesare stillindividual entities (not NFS roots to a single maintained image).Thoughgranted that the nodes being all identical, emerge -K should intheory be a
breeze....but it's not the case for maintaining all the config files
consistent.
2. The install environment uses the Gentoo Installer, with the CLI
frontend I wrote for the GLI project, and performs completeinstalls of
nodes in under 20 minutes (depending on network traffic).
So switching a machine's purpose/profile requires a complete re-install on thenode? You state 20 minutes for re-installing, is it a _real_install or thedump of a "reference" root? (Pardon my ignorance of the CLIinstaller you are
referring to... I'll read the  http link you'll send me ;) )
By using GLI, it's a simple matter of altering the installprofiles toreconfigure the cluster, and wipe the nodes for changing theirpurpose
(presently we have an MPI mode and a MOSIX mode), some of the cluster
users need assurances that none of their data remains on the cluster
after they are done, hence being able to reinstall easily.
[...]
Also, make use of your cluster tools to administer the cluster.OpenPBS
allows running a job on all nodes, so use it to emerge -K [package].
(not -k as binpkgs don't currently have any locking in $PKGDIR,and can
get corrupted if two emerge processes try to create a binpkg at the
same time.)
Actually, I would have thought you use _one_ node to compile thepackages(using distcc at your description) and _then_ propagate the packageonto theother nodes with -K....still, I would think maintaining an NFSmounted ROOT
would be less cumbersome....

--
Eric Thibodeau

--
[email protected] mailing list



--
[email protected] mailing list

Re: [gentoo-cluster] Gentoo clustering "?? la" www.clustermatic.org (PXE booted nodes)

Reply via email to