> From: "Takehiro Tominaga" <[EMAIL PROTECTED]>
>
> >>>>> "M" == Mathew Hendry <[EMAIL PROTECTED]> writes:
>
> M> I got Takehiro's code working (the fp stack was out of order)
> M> but it seems to be slightly slower than the original. Here's
> M> the fixed code anyway - the main loop in quantize_xrpow
>
> slow ????
> umm,,,, What kind of architechture you use ?
Sorry, should have mentioned I'm using a Celery 400 / Win2k. The difference
in speed is small but consistent.
> if P6 (dynamic execution arch.), it stalls continuous fmul section.
> if P5 or K6-2/III, it can't speed up by fxch pipeline hack.
Maybe Acy Stapp can help out here. His original MSVC code was aimed at P6,
but I think he made some comments about changes for P5 when he posted the
patch. My gcc code was a conversion of that.
> Anyway, I am now replacing these asm routines into GOGO's more aggressive
> code. 3DNow!/SSE will bring us a crazyly fast encoding.
Bah, and I don't have either of those extensions... still waiting for decent
dual K7 boards to appear. ;)
-- Mat.
--
MP3 ENCODER mailing list ( http://geek.rcc.se/mp3encoder/ )