Re: Sandybridge addmul_N challenge

2012-02-27 Thread Torbjorn Granlund
Torbjorn Granlund writes: carry-in lo in r14 carry-in hi in rcx mov 0(up), %rax mul v1 mov 8(rp), %r8 add %rax, %r8 mov %rdx, %r9 adc $0, %r9 mov 8(up), %rax mul v0 add

New "fastsse" assembly

2012-02-27 Thread Torbjorn Granlund
I pushed new x86_64 assembly making use of 128-bit instructions working on xmm registers. While all x86_64 processors probably support the instructions used, some have less throughput using these than when using plain 64-bit instructions. The idea is to include these just before "x86_64" in the m

Re: Problem with the mp_set_memory_functions interface

2012-02-27 Thread Niels Möller
Any thoughts on this problem with the custom memory allocation functions? ni...@lysator.liu.se (Niels Möller) writes: > I find this part of the interface, > > : The REALLOCATE_FUNCTION parameter OLD_SIZE and the FREE_FUNCTION > : parameter SIZE are passed for convenience, but of course they