Eero Pajarre <[EMAIL PROTECTED]> writes:
> I successfully compiled the x86 assembly code into my
> copy of Mesa on win95. I did this using the nasm assembler -
> rest of my compilation was done using MS VC++ 5.0.
>
> I first tried to compile the assembly using cygwin gcc (gas) to produce
> the MS compatible object file, the compilation was ok but the
> resulting library certainly was not.
>
>
> When compiling with NASM I met one typo in the assembly source,
> JZ ctp4_finish was missing the "NEAR" tag. Apparently this is
> because x86a.S.m4 around line 1350 has JZ LLBL(ctp4_finish)
> the correct form seems to be JZ(LLBL(ctp4_finish))
>
>
> When looking at the compiled program Vtune 3.0 got
> fits from sequences like:
> shl ECX , 4
> jz near p2mgm_finish
>
> It complained that the first instruction sets some of the flags
> (but not all), while the second was using the flags and claimed that
> this sequence would get a 12-16 cycle penalty on PPro and P2
>
> As this sequency is not inside the transformation inner loop the actual
> effect
> was not really visible in my tests (and this might of course be
> an error from VTUNE). In any case it is possible to
> get rid of the problem by placing for example TEST ECX,ECX between
> the offending instructions.
Thanks Eero. I've checked in your changes.
Josh
_______________________________________________
Mesa-dev maillist - [EMAIL PROTECTED]
http://lists.mesa3d.org/mailman/listinfo/mesa-dev