Good day Alex,

you could try implementing an "exclusive" ressource and request it with
"--soft", e.g. "--soft -l exclusive" in sge_request file as default.

I have never tried this combination but have a look at "man complex",
it's just an idea....  Erik Soyez.


On Wed, 16 Mar 2011, Alex Phillips wrote:

Dear List,
We have a cluster of 1920 cores spread over 160 nodes (12 cores/node), we only run one code in one queue, with jobs of between 48 and 256 cores using an mpi pe. When benchmarking our code we found a 14-15% speedup by running on 6 cores/node, compared with 12 cores/node. We also found that if we ran on 6 cores/node, with a second job on the other 6cores/node, we still have a 5-6% speedup. So I have configured our mpi pe with allocation_rule = 6, and this works, however, as the cluster fills up, the scheduler is starting a second job on some nodes, before all the nodes are busy. How can we configure the scheduler to run one job on all the nodes, before starting a second job ? I have tried defining the number of slots as a complex value on the execution hosts, I?ve tried ?np_load_avg, np_load_avg, slots, and -slots as the load_formula, but I can?t get it to work. I?ve read _http://blogs.sun.com/sgrell/entry/grid_engine_scheduler_hacks_least_ but I can?t set the allocation rule to $pe_slots, as we only want to run on 6 cores/node, not 12.
Any suggestions ?


--

--
Vorstand/Board of Management:
Dr. Bernd Finkbeiner, Dr. Roland Niemeier, Dr. Arno Steitz, Dr. Ingrid Zech
Vorsitzender des Aufsichtsrats/
Chairman of the Supervisory Board:
Michel Lepert
Sitz/Registered Office: Tuebingen
Registergericht/Registration Court: Stuttgart
Registernummer/Commercial Register No.: HRB 382196

_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to