On Mon, 2008-05-19 at 20:32 +0200, Thomas Hellström wrote: > Keith Packard wrote: > > On Mon, 2008-05-19 at 05:09 -0700, Keith Whitwell wrote: > > > > > >> I > >> think the latter is the significant result -- none of these experiments > >> in memory management significantly change the command stream the > >> hardware has to operate on, so what we're varying essentially is the > >> CPU behaviour to acheive that command stream. And it is in CPU usage > >> where GEM (and Keith/Eric's now-abandoned TTM driver) do significantly > >> dissapoint. > >> > > > > Your GEM results do not match mine; perhaps we're running different > > kernels? Anything older than 2.6.24 won't be using clflush and will > > instead use wbinvd, a significant performance impact. Profiling would > > show whether this is the case. > > > > I did some fairly simple measurements using openarena and enemy > > territory. Kernel version 2.6.25, CPU 1.3GHz Pentium M, 915GMS with the > > slowest possible memory. I'm afraid I don't have a working TTM > > environment at present; I will try to get that working so I can do more > > complete comparisons. > > > > fps real user kernel > > glxgears classic: 665 > > glxgears GEM: 889 > > openareana classic: 17.1 59.19 37.13 1.80 > > openarena GEM: 24.6 44.06 25.52 5.29 > > enemy territory classic: 9.0 382.13 226.38 11.51 > > enemy territory GEM: 15.7 212.80 121.72 40.50 > > > > > Keith, > > The GEM timings were done with 2.6.25, except on the i915 system texdown > timings which used 2.6.24. > Indeed, Michel reported much worse GEM figures with 2.6.23.
We clearly need to find a way to generate reproducible benchmark data.
Here's what I'm running:
kernel:
commit 4b119e21d0c66c22e8ca03df05d9de623d0eb50f
Author: Linus Torvalds <[EMAIL PROTECTED]>
Date: Wed Apr 16 19:49:44 2008 -0700
Linux 2.6.25
(there's a patch to export shmem_file_setup on top of this)
mesa (from git://people.freedesktop.org/~keithp/mesa):
commit 8b49cc104dd556218fc769178b96f4a8a428d057
Author: Keith Packard <[EMAIL PROTECTED]>
Date: Sat May 17 23:34:47 2008 -0700
[intel-gem] Don't calloc reloc buffers
Only a few relocations are typically used, so don't clear
the
whole thing.
drm (from git://people.freedesktop.org/~keithp/drm):
commit 6e46a3c762919af05fcc6a08542faa7d185487a1
Author: Eric Anholt <[EMAIL PROTECTED]>
Date: Mon May 12 15:42:20 2008 -0700
[GEM] Update testcases for new API.
xf86-video-intel (from git://people.freedesktop.org/~keithp/xf86-video-intel):
commit c81050c0058e32098259b5078515807038beb7d6
Merge: 9c9a5d0... e9532f3...
Author: Keith Packard <[EMAIL PROTECTED]>
Date: Sat May 17 23:26:14 2008 -0700
Merge commit 'origin/master' into drm-gem
> Your figures look a bit odd. Is glxgears classic CPU-bound? If not, why
> does it give a significantly slower framerate than
> glxgears GEM?
glxgears uses 40% of the CPU in both classic and gem. Note that the gem
version takes about 20 seconds to reach a steady state -- the gem driver
isn't clearing the gtt actively and so glxgears gets far ahead of the
gpu.
My theory is that this shows that using cache-aware copies from a single
static batch buffer (as gem does now) improves cache performance and
write bandwidth.
--
[EMAIL PROTECTED]
signature.asc
Description: This is a digitally signed message part
------------------------------------------------------------------------- This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/
-- _______________________________________________ Dri-devel mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dri-devel
