Hi Kilian,
Thanks for explaining how to configure ClusterShell correctly for Slurm!
I've updated my Wiki information in
https://wiki.fysik.dtu.dk/niflheim/SLURM#clustershell now.
I would suggest you to add your examples to the ClusterShell
documentation, where I feel it may be hidden or missing.
/Ole
On 06/23/2017 06:37 PM, Kilian Cavalotti wrote:
But how do I configure fro Slurm?? I've copied the example file to
/etc/clustershell/groups.conf.d/slurm.conf, but this doesn't enable Slurm
partitions (here: xeon24) as ClusterShell groups:
# clush -g xeon24 date
Usage: clush [options] command
clush: error: No node to run on.
Could you kindly explain this (and perhaps add examples to the
documentation)?
>
Cheers,
--
Ole Holm Nielsen
PhD, Manager of IT services
Department of Physics, Technical University of Denmark,
Building 307, DK-2800 Kongens Lyngby, Denmark
E-mail: [email protected]
Homepage: http://dcwww.fysik.dtu.dk/~ohnielse/
Tel: (+45) 4525 3187 / Mobile (+45) 5180 1620
Sure! That's because the groups.conf.d/slurm.conf file defines new
group sources [1]. ClusterShell supports multiple group sources, ie.
multiple sources of information to define groups. There is a default
one, defined in groups.conf, which will be used when a group name is
used, without specifying anything else, as in your "clush -g xeon24
date" command. But since the "slurm" group source is not the default,
it's not used to map the "xeon24" group to the corresponding Slurm
partition.
So, you can either:
* use the -s option to specify a group source, or prefix the group
name with the group source name in the command line, like this:
$ clush -s slurm -g xeon24 date
or, more compact:
$ clush -w@slurm:xeon24 date
* or if you don't plan to use any other group source than "slurm", you
can make it the default with the following in
/etc/clustershell/groups.conf:
[Main]
# Default group source
default: slurm
With the example Slurm group source, you can easily execute commands
on all the nodes from a given partition, but also on nodes based on
their Slurm state, like:
$ clush -w@slurmstate:drained date
Hope this makes things a bit clearer.
[1]
https://clustershell.readthedocs.io/en/latest/config.html#external-group-sources