Yeah, we've been running CentOS 6 and slurm in this fashion for about a
year and a half on about a thousand machines and haven't really had a
problem with this. Though I don't know if this method scales
indefinitely. We just have a symlink back to our conf from
/etc/slurm/slurm.conf. We then control the version via RPM installs.
-Paul Edmon-
On 3/24/2015 4:22 PM, Jason Bacon wrote:
I ran one of our CentOS clusters this way for about a year and found
it to be more trouble than it was worth.
I recently reconfigured it to run all system services from local disks
so that nodes are as independent of each other as possible. Assuming
you have ssh keys on all the nodes, syncing slurm.conf and other files
is a snap using a simple shell script. We only use NFS for data files
and user applications at this point.
Of course, if your compute nodes don't have local disks, that's
another story.
Jason
On 03/24/15 14:42, Jeff Layton wrote:
Good afternoon,
I apologies for the newb question but I'm setting up slurm
for the first time in a very long time. I've got a small cluster
of a master node and 4 compute nodes. I'd like to install
slurm on an NFS file system that is exported from the master
node and mounted on the compute nodes. I've been reading
a bit about this but does anyone have recommendations on
what to watch out for?
Thanks!
Jeff