I'm announcing an updated version 0.41 of the node status tool "pestat" for Slurm. Colored output is now printed also when all nodes are listed, and the logic has been cleaned up a bit.

Download the tool (a short bash script) from https://ftp.fysik.dtu.dk/Slurm/pestat. If your commands do not live in /usr/bin, please make appropriate changes in the CONFIGURE section at the top of the script.

Usage: pestat [-p partition(s)] [-u username] [-q qoslist] [-s statelist] [-f] [-V] [-h]
where:
        -p partition: Select only partion <partition>
        -u username: Print only user <username>
        -q qoslist: Print only QOS in the qoslist <qoslist>
        -s statelist: Print only nodes with state in <statelist>
        -f: Print only nodes that are flagged by * (unexpected load etc.)
        -h: Print this help information
        -V: Version information

I use "pestat -f" all the time because it prints and flags (in color) only the nodes which have an unexpected CPU load or node status, for example:

# pestat -f
Select only user user01
Hostname       Partition     Node Num_CPU CPUload  Memsize Joblist
                            State Use/Tot             (MB) JobId User ...
    g045          xeon16    alloc  16  16   11.81*    64000 84943 user01
    g047          xeon16    alloc  16  16   11.79*    64000 84943 user01
    g068          xeon16    alloc  16  16   15.11*    64000 84943 user01

The -s option is useful for checking on possibly unusual node states, for example:

# pestat -s mixed

--
Ole Holm Nielsen
Department of Physics, Technical University of Denmark

Reply via email to