I'm announcing an updated version 0.41 of the node status tool "pestat"
for Slurm. Colored output is now printed also when all nodes are
listed, and the logic has been cleaned up a bit.
Download the tool (a short bash script) from
https://ftp.fysik.dtu.dk/Slurm/pestat. If your commands do not live in
/usr/bin, please make appropriate changes in the CONFIGURE section at
the top of the script.
Usage: pestat [-p partition(s)] [-u username] [-q qoslist] [-s
statelist] [-f] [-V] [-h]
where:
-p partition: Select only partion <partition>
-u username: Print only user <username>
-q qoslist: Print only QOS in the qoslist <qoslist>
-s statelist: Print only nodes with state in <statelist>
-f: Print only nodes that are flagged by * (unexpected load etc.)
-h: Print this help information
-V: Version information
I use "pestat -f" all the time because it prints and flags (in color)
only the nodes which have an unexpected CPU load or node status, for
example:
# pestat -f
Select only user user01
Hostname Partition Node Num_CPU CPUload Memsize Joblist
State Use/Tot (MB) JobId User ...
g045 xeon16 alloc 16 16 11.81* 64000 84943 user01
g047 xeon16 alloc 16 16 11.79* 64000 84943 user01
g068 xeon16 alloc 16 16 15.11* 64000 84943 user01
The -s option is useful for checking on possibly unusual node states,
for example:
# pestat -s mixed
--
Ole Holm Nielsen
Department of Physics, Technical University of Denmark