On 4 December 2015 at 21:07, Feng Zhang <[email protected]> wrote:
> Did you use "-l gpu=1" in your job script?
>
> --
> Best,
>
> Feng

I am not having very much luck with this. The job is getting stuck
#qalter -w p 412
Job 412 cannot run in queue "all.q" because it is not contained in its
hard queue list (-q)
Job 412 cannot run in PE "mpi" because it only offers 2 slots
verification: no suitable queues

The job script looks like this

#!/bin/csh
#$ -V
#$ -S /bin/csh
#$ -N j1
#$ -q "gpu.q"
#$ -l gpu=1
#$ -m beas
#$ -j y -o /home/rajil/tmp/tst/j1.qlog
#$ -pe mpi 8
abaqus python /share/apps/abaqus/6.14-2/../abaJobHandler.py j1
/home/rajil/tmp/tst j1.fs.131566 0 j1.com model.inp


PE mpi is defined as

#qconf -sp mpi
pe_name mpi
slots 9999
user_lists NONE
xuser_lists NONE
start_proc_args /opt/gridengine/mpi/startmpi.sh $pe_hostfile
stop_proc_args /opt/gridengine/mpi/stopmpi.sh
allocation_rule $fill_up
control_slaves FALSE
job_is_first_task TRUE
urgency_slots min
accounting_summary TRUE


Each node has 32 cpus and 1 gpu. Why is pe_slots being limited to 2
even when 64 cpus are available?

-Rajil
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to