Am 02.02.2006 um 13:23 schrieb Gustaf Neumann:

could not resist to try this on our p5 production system under modest load (64bit, linux, lpar with 25 processors, 8 dual-core with ibms version of hyperthreading)
processor       : 25
cpu             : POWER5 (gr)
clock           : 1904.448000MHz
revision        : 2.3

Urgs!?? What is this for a monster-machine???

The timings what you get are what I expected on a multi-cpu box.
However all our single-cpu boxes are WAY slower with ckalloc
then with the regular malloc.
Do you happen to have a single-cpu box where you can try this out?

What I'm trying to understand is: is this pattern regular or not?

If yes, then the default tcl allocator used for threading builds
is sub-optimal for general builds and has to be runtime-dependent
(use it for multi-cpu boxes and not on single-cpu). But before I
go the tough way of getting this done in Tcl (I will definitely
experience fierce opposition from "works-for-me" people...) I'd
better collect some very hard ammunition...

If not, then something is wrong with ALL our single-box machines
which is very, very hard to believe...

Thanks,
Zoran




Reply via email to