https://bugzilla.wikimedia.org/show_bug.cgi?id=67347

--- Comment #3 from Merlijn van Deen <[email protected]> ---
and, from man qstat:

       If the state is a(larm) at least on of the load thresholds defined in
the load_thresholds list of the queue configuration (see queue_conf(5)) is
currently exceeded, which prevents from scheduling further jobs to that queue.

       As  opposed to this, the state A(larm) indicates that at least one of
the suspend thresholds of the queue (see queue_conf(5)) is currently exceeded.
This will result in jobs running in that queue being successively suspended
       until no threshold is violated.

       The states s(uspended) and d(isabled) can be assigned to queues and
released via the qmod(1) command. Suspending a queue will cause all jobs
executing in that queue to be suspended.

(...)

       If  an E(rror) state is displayed for a queue, sge_execd(8) on that host
was unable to locate the sge_shepherd(8) executable on that host in order to
start a job. Please check the error logfile of that sge_execd(8) for leads
       on how to resolve the problem. Please enable the queue afterwards via
the -c option of the qmod(1) command manually.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to