> From: "Takehiro Tominaga" <[EMAIL PROTECTED]>
>
> >>>>> "M" == Mathew Hendry <[EMAIL PROTECTED]> writes:
>
>     M> I got Takehiro's code working (the fp stack was out of order)
>     M> but it seems to be slightly slower than the original. Here's
>     M> the fixed code anyway - the main loop in quantize_xrpow
>
> slow ????
> umm,,,, What kind of architechture you use ?

Sorry, should have mentioned I'm using a Celery 400 / Win2k. The difference
in speed is small but consistent.

> if P6 (dynamic execution arch.), it stalls continuous fmul section.
> if P5 or K6-2/III, it can't speed up by fxch pipeline hack.

Maybe Acy Stapp can help out here. His original MSVC code was aimed at P6,
but I think he made some comments about changes for P5 when he posted the
patch. My gcc code was a conversion of that.

> Anyway, I am now replacing these asm routines into GOGO's more aggressive
> code. 3DNow!/SSE will bring us a crazyly fast encoding.

Bah, and I don't have either of those extensions... still waiting for decent
dual K7 boards to appear. ;)

-- Mat.


--
MP3 ENCODER mailing list ( http://geek.rcc.se/mp3encoder/ )

Reply via email to