Re: [gridengine users] Changing usage_weight_list cpu=1.000000, mem=0.000000, io=0.000000

Mark Dixon Wed, 23 Mar 2011 03:14:31 -0700

On Fri, 18 Mar 2011, Reuti wrote:
...

Am I wildly off here? What do others set usage_weight_list to?
We use a fair share functional policy only regarding the slots, hencethe default works. Most of the time the slot count is the limitingfactor, not the memory (exceptions apply).


Ah, you're lucky :)

Unfortunately, we have some large memory users who are dominating thecluster with the default policy - so I have to do something :(

...

Whether the user requests 1G, 100M or 2G - the job is blocking acomplete node with 12 slots (why should it be more expensive when youuse even more memory than 1G?) and should be charged in the same way asa single job requesting 24 G. Maybe a reverse approach would do: checkwhat's left for other jobs, this will reduce the charge from the maximumvalue: if 23 G by 0 slots is left, you have to pay the full price, like0G left by 11 slots.

Unfortunately, I'm stuck with what the scheduler can do today. I agreethat the current configuration options do not do exactly what I need, butto be honest I'm not sure what improvements we can make that also maintainGrid Engine's flexibility (e.g. multiple queues per host, slot does notnecessarily mean cpu, etc.). In fact, I'm even unsure how to make my ownmodel of usage more accurate without having to run a simulation, of thecluster, on the cluster :)

Perhaps we could extend the config so that the administrator can definethe usage calculation? This might be a good start - being able to usesomething other than a straight line would be useful - while not shacklingus to specific assumptions. This would allow Grid Engine to adopt anorganisation's definition of "fair", rather than the other way around...

To be honest though, I still need to see how influential this option wouldbe on scheduling decisions in practice: with many workloads, there may beenough "noise" in the system to get away with the present options.

* 6slot 2G/slot  (half a node) =  6*(0.51 + 0.49*2)  =  8.94 usage per sec
(1G is our default memory allotment)
I'm not wildly happy that the half node case is being over-charged(going to have to think about that),
Yep, this was the thing I mentioned with that you can run twice of them.

Throwing a few numbers around, I have come to the conclusion that this isacceptable in our environment.

With the default policy, the example 12slot host and the worst-case job(1slot + all memory in box), the usage calculation is a factor of 12 out.

Bringing memory into the policy, the usage calculation can be at worst afactor of 2 out.

As long as enough large-memory jobs are being submitted, I think this canbe a reasonable trade-off.


Mark
--
-----------------------------------------------------------------
Mark Dixon                       Email    : [email protected]
HPC/Grid Systems Support         Tel (int): 35429
Information Systems Services     Tel (ext): +44(0)113 343 5429
University of Leeds, LS2 9JT, UK
-----------------------------------------------------------------
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Re: [gridengine users] Changing usage_weight_list cpu=1.000000, mem=0.000000, io=0.000000

Reply via email to