Re: 7.0 CPU and Memory Performance

Jason Evans Wed, 13 Aug 2008 15:38:11 -0700

Kris Kennaway wrote:

Tim Traver wrote:
And here is the run of the ubench.5.4 binary:
FreeBSD 7.0 - CPU 139,623 - MEM - 207,180
And a rerun of the FreeBSD 7.0 ubench making sure there is absolutelyno activity on the box
FreeBSD 7.0 - CPU 200,562 - MEM - 107,695
That run is a little better than the previous one, but there seems tostill be quite a difference in the memory tests...
Does that show anything ????
It shows that if there is a difference it is probably in userland, notthe kernel. The obvious guess is the new malloc in 7.0. As for whetherit indicates a bug, someone would have to look more closely at whatubench does. The author's description of his benchmark doesn't inspireconfidence: it does "rather senseless memory allocation and memory tomemory copying operations for another 3 mins concurrently using severalprocesses".

The ubench memory benchmark operates almost entirely on 1024B buffers,which is nearly worst case for jemalloc. Also, its memory usefluctuates wildly, in a pattern that causes a lot of dirty page flushingand chunk map/unmap activity. That is where most of the difference is;jemalloc is more aggressive/effective in returning pages to the VM thanis phkmalloc. In order to verify the cause of the performancedifference, I ran ubench (on an 8-current system) withMALLOC_OPTIONS=7F6K (avoid flushing dirty pages, and use 64-MiB chunksin order to avoid repeatedly mapping/unmapping chunks), and the ubenchmemory benchmark sped up by ~51%. With the default configuration,jemalloc was ~13% slower than phkmalloc, but with 7F6K it was ~31%faster than phkmalloc.

On possible factor for stock FreeBSD 7.0 is a scalability issue that IMFC'ed a fix for in r176922 on 7 March (shortly after the 7.0 release).And, there's a non-trivial overall performance improvement that I'mplanning to MFC this week.

I encourage you to find some better way of testing memory performancethan ubench. Generic malloc benchmarking is *hard*. The most effectiveapproach for someone not specifically interested in allocators is tobenchmark the actual applications that will be run in production. Ifyou find that jemalloc performs poorly in such circumstances, please letme know the details so that I can look into possible improvements.


Thanks,
Jason
_______________________________________________
[email protected] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-performance
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: 7.0 CPU and Memory Performance

Reply via email to