Yes. When you require your users to specify h_rt or some other resource,
it's good practice to make '-w e' a default switch. This causes the job
to fail immediately if they forget to specify the resource, alerting the
submitter that there's something wrong with their job specification.
Otherwise, the job is accepted, but sits in the queue forever... or it
might fail when SGE finally attempts to run it. I forget exactly what
happens, and there's too many jobs queued up in my cluster to test it
right now.

--
Prentice


On 02/01/2012 01:21 PM, Rayson Ho wrote:
> I think I know how to fix this, but what I wanted to find out is - is
> it common to check ("-w e") before submitting jobs to Grid Engine??
>
> Rayson
>
>
>
> On Fri, Jan 27, 2012 at 4:50 PM, Reuti <[email protected]> wrote:
>> Hi,
>>
>> Am 27.01.2012 um 22:23 schrieb Prentice Bisbal:
>>
>>> GE users,
>>>
>>> I while back, I made it mandatory for my users to specify h_rt when they
>>> submit jobs. To prevent jobs from being queued but never running, I added
>>>
>>> -w e
>>>
>>> to my $SGE_ROOT/default/common/sge_request
>>>
>>> An unintended side effect is that now certain jobs get rejected if they
>>> can't be run immediately with this error:
>> yep, I faced the same:
>>
>> http://gridengine.org/pipermail/users/2011-March/000512.html
>>
>> I don't know whether it was fixed in any fork.
>>
>> -- Reuti
>>
>>
>>> Unable to run job: error: no suitable queues.
>>> Exiting.
>>>
>>> If I submit the same job, but specify '-w -w' or '-w n',  it will be
>>> accepted and queued up
>>>
>>> Reading the qsub man page, this should not be. -w validates a job
>>> assuming an empty system with no other jobs, so if a job won't run with
>>> '-w e', it certainly shouldn't run without it, either!
>>>
>>> Here's an example of my problem using a simple MPI "Hello, World" Program:
>>>
>>> # My submit script
>>>
>>> $ more mpihello.sh
>>> #!/bin/bash
>>> #$ -N mpihello
>>> #$ -pe orte 1
>>> #$ -cwd
>>> #$ -V
>>> #$ -R y
>>> #$ -l "h_rt=00:05:00,exclusive=true,cuda=false"
>>>
>>> MPI=/usr/local/openmpi/pgi/x86_64
>>> PATH=${MPI}/bin:${PATH}
>>> LD_LIBRARY_PATH=${MPI}/lib
>>> mpirun ./mpihello
>>>
>>> # My 'normal' qsub command
>>>
>>> $ qsub mpihello.sh
>>> Unable to run job: error: no suitable queues.
>>> Exiting.
>>>
>>> # Using qsub with '-w n'
>>>
>>> $ qsub -w n mpihello.sh
>>> Your job 1247254 ("mpihello") has been submitted
>>>
>>> In this case, my job is failing because of the exclusive=true in the
>>> submit script. All of my cluster nodes are busy at the moment, so I
>>> can't get exclusive use of any node right now. I would expect the job to
>>> be queued until a node becomes free, but instead it's being rejected.
>>> I'm using SGE 6.2u3. Is this is a bug?
>>>
>>> --
>>> Prentice
>>>
>>> _______________________________________________
>>> users mailing list
>>> [email protected]
>>> https://gridengine.org/mailman/listinfo/users
>> _______________________________________________
>> users mailing list
>> [email protected]
>> https://gridengine.org/mailman/listinfo/users
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to