Hi Ole, Ole Holm Nielsen <[email protected]> writes:
> You may want to throw in a uniq command in case the user runs multiple jobs on > some nodes: > > # squeue -u user123 -h -o "%N" | tr '\n' , | xargs scontrol show > hostlistsorted > b[135,135,135] > > This gives a better list: > > # squeue -u user123 -h -o "%N" | uniq | tr '\n' , | xargs scontrol show > hostlistsorted > b135 > > BTW, if you enter a non-existent user, the output is an unexpected error > message > and a long help info :-) > > /Ole I have just realised that pdsh, which was what I wanted the consolidated list for, has a Slurm module, which knows about Slurm jobs. I followed your instructions here: https://wiki.fysik.dtu.dk/niflheim/SLURM#pdsh-parallel-distributed-shell with some modifications for EPEL6. However, in the 'rebuild' line rpmbuild --rebuild --with=slurm --without=torque pdsh-2.26-4.el6.src.rpm fails with --with=slurm: unknown option The page https://github.com/grondo/pdsh implies it should be rpmbuild --rebuild --with-slurm --without-torque pdsh-2.26-4.el6.src.rpm but this also fails: --with-slurm: unknown option Any ideas what I'm doing wrong? Cheers, Loris -- Dr. Loris Bennett (Mr.) ZEDAT, Freie Universität Berlin Email [email protected]
