On 06/22/2017 06:22 PM, Kilian Cavalotti wrote:> ClusterShell is
incredibly useful, it provides not only a parallel
shell for remote execution (and file distribution, output aggregation
or diff'ing...), but also an event-driven Python library that can be
used in your Python scripts, and CLI tools to manipulate node sets
(any kind of logical operation between node groups, expansion,
multi-dimensional folding, counting, stepping, you name it). Oh, and
the tree mode [3]? You have to try it.
I can only encourage you to give a look at the documentation [2],
there are too many awesome features to describe here. ;)
[1] https://cea-hpc.github.io/clustershell/
[2] https://clustershell.readthedocs.io/en/latest/intro.html
[3] https://clustershell.readthedocs.io/en/latest/tools/clush.html#tree-mode
Yes, ClusterShell has indeed lots of features and compares favorably to
PDSH. I've added a brief description in my Slurm Wiki
https://wiki.fysik.dtu.dk/niflheim/SLURM#clustershell, please comment on
it off-line if you have the time.
However, after a brief reading of the ClusterShell manual, it hasn't
dawned upon me how I use it with Slurm partitions. The basic
functionality is OK:
# clush -w i[001-003] date
i001: Fri Jun 23 09:52:29 CEST 2017
i003: Fri Jun 23 09:52:29 CEST 2017
i002: Fri Jun 23 09:52:29 CEST 2017
But how do I configure fro Slurm?? I've copied the example file to
/etc/clustershell/groups.conf.d/slurm.conf, but this doesn't enable
Slurm partitions (here: xeon24) as ClusterShell groups:
# clush -g xeon24 date
Usage: clush [options] command
clush: error: No node to run on.
Could you kindly explain this (and perhaps add examples to the
documentation)?
Thanks,
Ole