Hello,

I am having a simple problem where the behaviour of mem_free resource
requests are being treated differently on two different queues (actually,
separate installations of SGE).

For context, the hosts servicing queue.A have 32G/4cores available, while
those servicing queue.B have 24G/4cores available. All queues/nodes are
idle/empty when I perform the following tests.

On queue.A, the following jobs get scheduled and run as expected:

$ echo ls | qsub -q queue.A -pe smp 1 mem_free=20G #runs right away
$ echo ls | qsub -q queue.A -pe smp 2 mem_free=20G #also runs right away

On queue.B, only the first gets run:

$ echo ls | qsub -q queue.A -pe smp 1 mem_free=20G #runs right away
$ echo ls | qsub -q queue.A -pe smp 2 mem_free=20G #sits in 'qw' state
forever

Conjecture: mem_free is being treated as a "per-core" request for queue.B
but not for queue.A.

Question 1: is this possible?
Question 2: how do I fix queue.B so that it performs like queue.A?

Apologies if my terminology is not 100% correct; I deal with SGE
administration infrequently.

Thanks in advance for any suggestions,

Joe
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to