Keith Whitwell <[EMAIL PROTECTED]> writes:
> This hasn't really been the case since the fxfastpath code was introduced. A
> sample profile (from a *long* time ago, but more or less still correct) is
> attached. Clipping is actually done in routines called called
> render_vb_triangles_TMU0_TMU1 and also in fx_tri_clip_RGBA_TMU0 (the equivalent
> routine is inlined in the TMU0_TMU1 case, which is why it doesn't show up).
> Clipping is still a significant percentage, but it is not 'number 1'...
>
> % cumulative self self total
> time seconds seconds calls us/call us/call name
>
> 10.03 1.75 1.75 163106 10.73 10.73
> gl_3dnow_transform_points3_general_raw_fx2
> 8.43 3.22 1.47 48173 30.52 30.52
> render_vb_triangles_TMU0_TMU1
> 7.51 4.53 1.31 48173 27.19 27.19
That's a lot better than I would have expected (nice job, Keith!).
SSE transformation functions may be a larger improvement than I
thought. I still don't think it will be 10% though.
Josh
_______________________________________________
Mesa-dev maillist - [EMAIL PROTECTED]
http://lists.mesa3d.org/mailman/listinfo/mesa-dev