>>> On Thu, Feb 21, 2008 at 11:41 AM, in message <[EMAIL PROTECTED]>,
Andi Kleen <[EMAIL PROTECTED]> wrote: 

>> +config RTLOCK_DELAY
>> +    int "Default delay (in loops) for adaptive rtlocks"
>> +    range 0 1000000000
>> +    depends on ADAPTIVE_RTLOCK
> 
> I must say I'm not a big fan of putting such subtle configurable numbers
> into Kconfig. Compilation is usually the wrong place to configure
> such a thing. Just having it as a sysctl only should be good enough.
> 
>> +    default "10000"
> 
> Perhaps you can expand how you came up with that default number? 

Actually, the number doesn't seem to matter that much as long as it is 
sufficiently long enough to make timeouts rare.  Most workloads will present 
some threshold for hold-time.  You generally get the best performance if the 
value is at least as "long" as that threshold.  Anything beyond that and there 
is no gain, but there doesn't appear to be a penalty either.  So we picked 
10000 because we found it to fit that criteria quite well for our range of GHz 
class x86 machines.  YMMY, but that is why its configurable ;)

> It looks suspiciously round and worse the actual spin time depends a lot on 
> the 
> CPU frequency (so e.g. a 3Ghz CPU will likely behave quite 
> differently from a 2Ghz CPU) 

Yeah, fully agree.  We really wanted to use a time-value here but ran into 
various problems that have yet to be resolved.  We have it on the todo list to 
express this in terms in ns so it at least will scale with the architecture.

> Did you experiment with other spin times?

Of course ;)

> Should it be scaled with number of CPUs?

Not to my knowledge, but we can put that as a research "todo".

> And at what point is real
> time behaviour visibly impacted? 

Well, if we did our jobs correctly, RT behavior should *never* be impacted.  
*Throughput* on the other hand... ;)

But its comes down to what I mentioned earlier. There is that threshold that 
affects the probability of timing out.  Values lower than that threshold start 
to degrade throughput.  Values higher than that have no affect on throughput, 
but may drive the cpu utilization higher which can theoretically impact tasks 
of equal or lesser priority by taking that resource away from them.  To date, 
we have not observed any real-world implications of this however.

> 
> Most likely it would be better to switch to something that is more
> absolute time, like checking RDTSC every few iteration similar to what
> udelay does. That would be at least constant time.

I agree.  We need to move in the direction of time-basis.  The tradeoff is that 
it needs to be portable, and low-impact (e.g. ktime_get() is too heavy-weight). 
 I think one of the (not-included) patches converts a nanosecond value from the 
sysctl to approximate loop-counts using the bogomips data.  This was a decent 
compromise between the non-scaling loopcounts and the heavy-weight official 
timing APIs.  We dropped it because we support older kernels which were 
conflicting with the patch. We may have to resurrect it, however..

-Greg



--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Reply via email to