Re: Efficient memcpy()/memmove() for G2/G3 cores...

2008-09-04 Thread Gunnar Von Boehn
Hi Steve, > I have personally optimized memcpy for power4/5/6 and they are all > different. There are dozens of different PPC implementations from > different manufacturers and design, every one is different! With painful > negotiation I was able to get the --with-cpu= framework added to glibc > b

Re: Efficient memcpy()/memmove() for G2/G3 cores...

2008-09-04 Thread Gunnar Von Boehn
Hi David, Regarding your testcase. I think we all agree with you that improving the performance for PPC is a noble quest and we should all try do improve the performance where possible. Regarding the 5200B and 5221 CPUs. As we all know the 5200B is a G2 PowerPC from Freescale. The factor for

Re: Efficient memcpy()/memmove() for G2/G3 cores...

2008-09-04 Thread Gunnar Von Boehn
Steve, I think we should be grateful for people being interested in improving performance for PPC, and we should not bash them. The proposal to optimize the memcopy for the 5200 is good. Steve, you said that you've heard about the 5200.. Maybe I can refresh your memory: I did send you an optimi

Re: [RFC 1/3] powerpc: __copy_tofrom_user tweaked for Cell

2008-06-27 Thread Gunnar von Boehn
Gunnar von Boehn writes: > Interesting points. > Can you help me to understand where the negative effect of DCBZ does come > from? In my experience, dcbz slows down the hot-cac

Re: [RFC 1/3] powerpc: __copy_tofrom_user tweaked for Cell

2008-06-23 Thread Gunnar von Boehn
--- On Fri, 6/20/08, Gunnar von Boehn <[EMAIL PROTECTED]> wrote: > How important is best performance for the unaligned copy > to/from uncacheable memory? > The challenge of the CELL chip is that X-form of the shift > instructions a

Re: [Cbe-oss-dev] [RFC 1/3] powerpc: __copy_tofrom_user tweaked for Cell

2008-06-20 Thread Gunnar von Boehn
Gunnar von Boehn writes: > The "regular" code was much slower for the normal case and has a special > version for the 4K optimized case. That's a slightly inaccurate view... The reason for havi

Re: [RFC 0/3] powerpc: memory copy routines tweaked for Cell

2008-06-20 Thread Gunnar von Boehn
routines tweaked for Cell Gunnar von Boehn writes

Re: [RFC 1/3] powerpc: __copy_tofrom_user tweaked for Cell

2008-06-20 Thread Gunnar von Boehn
.com> To Arnd Bergmann <[EMAIL PROTECTED]>, 19/06/2008 18:13 Gunnar von Boehn/Germany/Contr/[

Re: [RFC 1/3] powerpc: __copy_tofrom_user tweaked for Cell

2008-06-19 Thread Gunnar von Boehn
cc Mark Nelson <[EMAIL PROTECTED]>, [EMAIL PROTECTED], Gunnar von Boehn/Germany/Contr/[EMAIL PROTECTED],

Re: [RFC 0/3] powerpc: memory copy routines tweaked for Cell

2008-06-19 Thread Gunnar von Boehn
Hi Arnd, I have no results for P5/P6, but I did some tests on JS21 aka PPC-970. On PPC-970 the CELL memcpy is faster than the current Linux routine. This becomes really visible when you really copy memory-to-memory and are not only working in the 2ndlevelcache. Kind regards Gunnar von Boehn