Greetings --

A few users have experienced jobs pending due to the reason
ReqNodeNotAvail(Unavailable:<nodename1>,<nodename2>,...)
We have determined that the jobs in fact were pending due to asking for
TimeLimit > "time remaining before maintenace shutdown" -- managed by
making a global reservation on all nodes, all partitions.

This reason code is not very helpful in understanding the reason for the
jobs being pending. Resubmitting the jobs with appropriate TimeLimit allows
the jobs to run immediately. The jobs were therefore pending due to
"excessive time requested".

To prehaps help knowledgeable developers understand why the Reason
ReqNodeNotAvail appeared, I note that the nodes listed in the above are
actually being drained in advance of updates. Happy to provide further
information as needed.

Best wishes,
~ Emily

----------------------------------
E.M. Dragowsky, Ph.D.
ITS -- Research Computing
Case Western Reserve University
(216) 368-0082

Reply via email to