Am 28.03.2014 um 21:31 schrieb Karun K:
> No other requests, just slots
>
> echo "sleep 10" | qsub -pe threaded 19
Yes, this job. This was the request for all jobs running right now too?
The set slots valuein a the PE definition is large enough to cover all nodes?
> default_duration is set to INFINITY currently.
It's best to put a value here like a couple of days representing the usual
runtime of the jobs, otherwise the INFINITY might allow jobs with a reservation
to slip in in case no "-l h_rt=..." is requested which sets the actual expected
runtime (and they might reserve slots beforehand too).
Can you submit the jobs with a reservation ("-R y") and switching on
reservation by setting "max_reservation" in the scheduler configuration?
-- Reuti
> Thanks!
>
>
> On Thu, Mar 27, 2014 at 5:33 PM, Reuti <[email protected]> wrote:
> Am 28.03.2014 um 00:28 schrieb Karun K:
>
> > We have a pe environment threaded and each node has 30 slots, 120GB ram.
> > Jobs requiring pe slots >= 19 are getting stuck in queue in qw state with
> > following error,
> >
> > parallel environment: threaded range: 19
> > scheduling info: cannot run in PE "threaded" because it only
> > offers 0 slots
>
> This output is (often) misleading.
>
> No job requests any "-l h_rt=..." and "-R y" for a reservation? What is the
> value of "default_duration" in the scheduler configuration?
>
> -- Reuti
>
>
> > which doesnt make any sense. currently there are more than 30 nodes that
> > are idle with 30 slots each
> >
> > I am running simple test job, no other complexes are requested.
> > echo "sleep 10" | qsub -pe threaded 19
> >
> > We are using GE 2011.11p1
> >
> > Here is the output of one of execute host in sge config,
> >
> > hostname compute-2-2.local
> > load_scaling NONE
> > complex_values slots=30,h_vmem=120G,io_slots=30
> > load_values arch=linux-x64,num_proc=30,mem_total=123136.023438M, \
> > swap_total=3999.992188M,virtual_total=127136.015625M,
> > \
> > load_avg=11.020000,load_short=11.000000, \
> > load_medium=11.020000,load_long=10.810000, \
> > mem_free=75806.339844M,swap_free=3973.246094M, \
> > virtual_free=79779.585938M,mem_used=47329.683594M, \
> > swap_used=26.746094M,virtual_used=47356.429688M, \
> >
> > cpu=36.200000,m_socket=30,m_core=30,np_load_avg=0.367333, \
> > np_load_short=0.366667,np_load_medium=0.367333, \
> > np_load_long=0.360333
> > processors 30
> > user_lists NONE
> > xuser_lists NONE
> > projects NONE
> > xprojects NONE
> > usage_scaling NONE
> > report_variables NONE
> >
> > Thanks,
> > _______________________________________________
> > users mailing list
> > [email protected]
> > https://gridengine.org/mailman/listinfo/users
>
>
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users