Josh Vanderhoof wrote:
> 
> Adam Wiggins <[EMAIL PROTECTED]> writes:
> 
> > Oh, they're more or less the same - the difference is that Intel's chips
> > have floating point which is comparable to their integer processing,
> > whereas AMD's K6's beat the pants off of Intel for integer, but lagged
> > way behind on floating point.  3DNow! was basically a quick fix to this
> > problem.
> 
> You're probably right, but 3dnow is still capable of being faster than
> standard instructions.  Here is how the different combinations compare
> in Mesa.  The times are cycles to transform 100 points3_general
> vertices.  (The K6-2 times are from Holger.)
> 
> K6-2 without 3dnow: 11338
> K6-2 with 3dnow:     1993
> Plain old pentium:   4742
> 
> Mesa running q3test spends a little more than 5% of its time in
> gl_x86_transform_points3_general on my Pentium (this may have changed
> slightly - I haven't profiled it lately), so the 3dnow version is
> probably around 2.5% faster if everything else is equal.  SSE (I think
> this is the current name for KNI) should be similar.
> 
> Clipping looks like the biggest bottleneck for q3test on Mesa right
> now.

This hasn't really been the case since the fxfastpath code was introduced.  A
sample profile (from a *long* time ago, but more or less still correct) is
attached.  Clipping is actually done in routines called called
render_vb_triangles_TMU0_TMU1 and also in fx_tri_clip_RGBA_TMU0 (the equivalent
routine is inlined in the TMU0_TMU1 case, which is why it doesn't show up). 
Clipping is still a significant percentage, but it is not 'number 1'...

  %   cumulative   self              self     total           
 time   seconds   seconds    calls  us/call  us/call  name    

 10.03      1.75     1.75   163106    10.73    10.73 
gl_3dnow_transform_points3_general_raw_fx2
  8.43      3.22     1.47    48173    30.52    30.52 
render_vb_triangles_TMU0_TMU1
  7.51      4.53     1.31    48173    27.19    27.19 
fx_project_clipped_vertices_TMU0_TMU1
  6.36      5.64     1.11    78776    14.09    24.82  fx_setup_full_RGBA_TMU0
  5.62      6.62     0.98   207667     4.72     4.72  fx_tri_clip_RGBA_TMU0
  5.56      7.59     0.97                             gl_x86_FX_cliptest_points4
  3.61      8.22     0.63    52804    11.93    11.93 
fx_project_vertices_RGBA_TMU0
  3.44      8.82     0.60   150301     3.99     3.99 
render_vb_triangles_fx_smooth_indirect
  3.10      9.36     0.54    80446     6.71    17.44  fx_setup_full_TMU0_TMU1
  2.52      9.80     0.44   163106     2.70     2.70  gl_prepare_arrays_cva
 

Keith


_______________________________________________
Mesa-dev maillist  -  [EMAIL PROTECTED]
http://lists.mesa3d.org/mailman/listinfo/mesa-dev

Reply via email to