In the message dated: Tue, 01 Jan 2013 17:38:23 +0100,
The pithy ruminations from Reuti on 
<Re: [gridengine users] How prevent abnormal nodes load using qsub?> were:
=> Am 31.12.2012 um 09:34 schrieb Semi:
=> 
=> > The memory is not a problem, the problem CPU load,
=> > every python process runs 2 other processes and this stuck nodes.
=> > For examle: 16 CPU nodes run 48 python processes.
=> 

        [SNIP!]

=> 
=> If all load is coming from his Python script, it needs to be adjusted to run 
only serial and not
=>  taking all available cores in the machine.

Assuming that there's a PE for running multi-threaded jobs already
configured in the cluster, the user could submit the job to that PE (named
"threaded" in the example below).  If each python process has 2 children,
then the job submission might look like:

     qsub -pe threaded 3 myjob.py

to tell SGE that each job uses 3 slots, so that SGE won't oversubscribe
a node.

Mark

=> 
=> -- Reuti
=> 
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to