My observations show that the results of the benchmark is heavily dependent on the amount of L2 cache.



Processor OS/Type VM L2 Cache Results Description
   
--------------------------------------------------------------------------------------------------
Intel Core2 Duo 2.0 32-bit VMware 4096 KB 1:43s Laptop AMD Opteron 165 1.8 64-bit 1024 KB 3:08s Home Desktop AMD Opteron 165 1.8 32-bit 1024 KB 2:21s Home Desktop AMD X2 4400+ 64-bit 512 KB 6:55s Work Desktop AMD X2 4400+ 32-bit 512 KB 6:27s Work Desktop Intel Xeon(TM) Core2 2.80GHz 32-bit 2048 KB 2:39s Work Server X-Series Intel Xeon(TM) 3.00GHz Blade 32-bit VMware 1024 KB 3:48s Work Server VM Bladecenter

The ones running on AMD chips were all running RHEL5. The servers (Core2 and Blade) were running RHEL4. And, the laptop was running Kubuntu 7.10 under VMware on WinXP. It appears that running in a virtualized environment has little effect. Note that the X2 (which is faster) performed worse than the Opteron. However, the Opteron has twice the L2 cache. The laptop has the most L2 cache and beat them all significantly. My thinking is that because of the large memory footprint this benchmark makes heavy use of L2 cache.

Richard Lynch
WVNET


John Summerfield wrote:
Pedro Espinoza wrote:
Thank you, Tony.

I just compiled with -m32, yes, it made a difference: from 8m40s to
2m51s, on that box. Is there any way to effect globally such that all
calls made to qsort() make use of 32bit stuff.

This benchmark is all very interesting in an academic sense, but I;m sceptical that it represents anyone's workload.

Nor does it do anything to exploit 64-bit capabilities. If you have lots of RAM then you stand to gain considerably from the 64-bit architecture; the CPUs handle it much better.

I don't know whether it's related, but I've misguidedly been trying to repair my 64-bit Fedora 9A system with a 32-bit F8 rescue disk. It took around 30 minutes to get it running ready to use, whereas once I got the equivalent 64-bit media it was all over in a minute or two. The system has 2 Gbytes of RAM.

This "benchmark" tests very little, and the measurements might be useful as part of a model to estimate a system's performance, but it's of little use in deciding how to run a complex workload.

For that you need a benchmark that is typical of what you really do, that actually runs Perl scripts. If you load up databases then your benchmark must include that activity, either with the same version (for commercial such as Oracle) or with the version matching your OS (for free such as Postgresql).

You need to measure the _system_ performance, that is the hardware and the software both, in configurations you use or that you might use.

Meanwhile, here are some more numbers:
+ for o in s 1 2 3
+ cc sorttest.c -Os -lm -o sorttest
+ ./sorttest 20

real    1m17.105s
user    1m16.270s
sys     0m0.671s
+ for o in s 1 2 3
+ cc sorttest.c -O1 -lm -o sorttest
+ ./sorttest 20

real    1m16.831s
user    1m16.027s
sys     0m0.605s
+ for o in s 1 2 3
+ cc sorttest.c -O2 -lm -o sorttest
+ ./sorttest 20

real    1m16.941s
user    1m16.241s
sys     0m0.628s
+ for o in s 1 2 3
+ cc sorttest.c -O3 -lm -o sorttest
+ ./sorttest 20

real    1m16.821s
user    1m16.072s
sys     0m0.623s
++ echo -ne '\033]0;[EMAIL PROTECTED]:~'

I can't test 32-bit on this system, I don't have the libraries installed.
+ cc sorttest.c -Os -lm -o sorttest
+ ./sorttest 40

real    2m32.955s
user    2m31.507s
sys     0m1.181s







--


_______________________________________________
rhelv5-list mailing list
rhelv5-list@redhat.com
https://www.redhat.com/mailman/listinfo/rhelv5-list

Reply via email to