Hi,

Am 07.01.2014 um 15:53 schrieb Edrisse Chermak:

> I have two 64 CPU nodes, node1 running a 16 CPUs job, and node2 which is
> free:
> 
> HOSTNAME  ARCH     NCPU  LOAD
> node1   linux-x64   64  16.00
> node2   linux-x64   64   0.00
> 
> When I launch a 2nd job asking for 64 CPU, Grid Engine sends sometimes
> the new job to node1.

This sounds like you are submitting a multi-core job without requesting a 
parallel environment (PE). It would be good to request for all parallel jobs a 
PE for the job where you specify 16 resp. 64 cores. Then this can't happen at 
all due to lack of free cores on node 1. A plain PE with the default values 
when you define a new one is sufficient, often it's named "smp" if it should 
stay on one node only for all slots.

`man sge_pe`

and submit:

$ qsub -pe smp 64 job.sh


(It's necessary to set the proper slot count [i.e. "64"] in the queue 
definition.)

-- Reuti
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to