[slurm-dev] Re: Using the same (mounted) slurm installation on all nodes

Jason Bacon Fri, 25 Jul 2014 06:18:33 -0700

Our CentOS cluster uses a shared installation for all the compute nodes,but separate local installations for the head node and backup headnode. The compute nodes share binaries and configuration files via NFS,but keep separate logs in their own local /var/log and the startupscript in their local init.d.

The head node and backup head node are independent of each other exceptfor shared state information. See "High Availability" in the SLURM docs:


    http://slurm.schedmd.com/quickstart_admin.html#Config

If NFS is properly configured, clients will wait indefinitely andcontinue where they left off, so an NFS server failure should not resultin loss of data as long as the server comes back online while the clientis still trying to complete its operations.

There are pros and cons to a separate server for the head node andbackup head node state information. With a separate server, both canoperate normally while the other is down. However, is the separateserver goes down, neither head node can operate normally until it comesback up. A single server failure is more likely with 3 servers than with 2.

If state information is kept on the primary head node, the backup headnode will be blocked from updating state information while the primaryis down, and vice versa. This shouldn't be a problem as long as theoutage is brief, such as a reboot required for system updates. Iroutinely reboot our primary head node for yum updates (after verifyingthat the backup head node is running normally).

In any case, the server where the state information is kept should be*very* reliable. We keep ours on the primary head node, which uses ahardware RAID1 for the boot disk and has very strict limits to keep theload to a minimum. Memory use and processes are both limited via/etc/security/limits.d/ and the head node has no access to thecomputational software installed on the cluster, so users aren't temptedto run "quick" jobs on the head node outside the scheduler.

It would be a nice feature if the head node and backup head node couldbe completely independent of each other, but I imagine that keeping themsynchronized would require some challenging coding and the real benefitwould be minimal.


Regards,

    Jason

On 07/25/14 03:33, Bastian Krüger wrote:

Using the same (mounted) slurm installation on all nodes
I recently began working with a cluster that consists of 1 controlnode and several computation node and it was set up a couple of yearsago by someone else. In this current setup, there is only one actualslurm installation, which is located on the control node in/usr/local/slurm. All the other nodes just mount that directory totheir /usr/local/slurm. The only thing that is copied between thenodes is the service startup script in /etc/init.d.
The question is, if that is a good idea or not. I realize that if thecontrol node fails, that all the other nodes lose the mounted slurmdirectory. But how crucial is that?
Also, I'm thinking about adding a backup control node. This node hasto share a directory with the first control node. Are there anyadvises on where this directory should be located? Could it live onthe backup control node or would it be better to use a separate server?

[slurm-dev] Re: Using the same (mounted) slurm installation on all nodes

Reply via email to