Hi John,

You might want use a custom formatting for the squeue using
*SQUEUE_FORMAT *environment
variable or the command line.

Ref: http://slurm.schedmd.com/squeue.html

<snip>
*-o <output_format>*, *--format=<output_format>* Specify the information to
be displayed, its size and position (right or left justified). Also see the *-O
<output_format>*, *--Format=<output_format>* option described below (which
supports less flexibility in formatting, but supports access to all
fields). The default formats with various options are


*default* "%.18i %.9P %.8j %.8u %.2t %.10M %.6D %R" *-l, --long* "%.18i
%.9P %.8j %.8u %.8T %.10M %.9l %.6D %R" *-s, --steps* "%.15i %.8j %.9P %.8u
%.9M %N"


The format of each field is "%[[.]size]type".
*size* is the minimum field size. If no size is specified, whatever is
needed to print the information will be used. * .* indicates the output
should be right justified and size must be specified. By default, output is
left justified.</snip>

Best regards,

Roshan

On 26 May 2016 at 13:34, John Hearns <[email protected]> wrote:

> Just an update - looks like I have solved this.
>
> I would still like to be able to see the full NODELIST(REASON) though!
>
> Looking iun the slurmd logs, I saw that the nodes were reporting a
> different slurm.conf from the master
> The solution was actuall to restart slurmd on the master in this case.
>
>
> -----Original Message-----
> From: John Hearns [mailto:[email protected]]
> Sent: 26 May 2016 09:21
> To: slurm-dev <[email protected]>
> Subject: [slurm-dev] ReqNodeNotAvail - can't see all info
>
>
> I am scheduling an HPCC job on a certain set of nodes using -nodelist
>
> I am getting informed that a node is not available - but for the life of
> me I cannot expand the NODELIST(REASON) fied to show it.
>
>
>         JOBID PARTITION     NAME     USER ST       TIME  NODES
> NODELIST(REASON)
>                340      defq run-hpcc    johnh PD       0:00      4
> (Resources)
>                344      defq run-hpcc    johnh PD       0:00      4
> (ReqNodeNotAvail(Unavailable:co
>                346      defq run-hpcc    johnh PD       0:00      1
> (ReqNodeNotAvail(Unavailable:co
>                350      defq run-hpcc    johnh PD       0:00      1
> (ReqNodeNotAvail(Unavailable:co
>                348      defq run-hpcc    johnh  R       1:17      2
> comp[15-16]
>
>
> Also this may be relevant - I have the known problem of a job not
> teminating properly.
> In slurmdbd
>
> [2016-05-26T09:00:00.838] error: We have more allocated time than is
> possible (172800 > 126000) for cluster slurm_cluster(35) from
> 2016-05-26T08:00:00 - 2016-05-26T09:00:00
>
> I ruan the lost.pl  script from the bugzilla and it finds not
> still-running jobs.
>
> slurm version 14.11.6
> Any views or opinions presented in this email are solely those of the
> author and do not necessarily represent those of the company. Employees of
> XMA Ltd are expressly required not to make defamatory statements and not to
> infringe or authorise any infringement of copyright or any other legal
> right by email communications. Any such communication is contrary to
> company policy and outside the scope of the employment of the individual
> concerned. The company will not accept any liability in respect of such
> communication, and the employee responsible will be personally liable for
> any damages or other liability arising. XMA Limited is registered in
> England and Wales (registered no. 2051703). Registered Office: Wilford
> Industrial Estate, Ruddington Lane, Wilford, Nottingham, NG11 7EP
> Any views or opinions presented in this email are solely those of the
> author and do not necessarily represent those of the company. Employees of
> XMA Ltd are expressly required not to make defamatory statements and not to
> infringe or authorise any infringement of copyright or any other legal
> right by email communications. Any such communication is contrary to
> company policy and outside the scope of the employment of the individual
> concerned. The company will not accept any liability in respect of such
> communication, and the employee responsible will be personally liable for
> any damages or other liability arising. XMA Limited is registered in
> England and Wales (registered no. 2051703). Registered Office: Wilford
> Industrial Estate, Ruddington Lane, Wilford, Nottingham, NG11 7EP
>

Reply via email to