Re: [PATCH] powerpc: Add POWER9 copy_page() loop

2018-01-27 Thread Michael Ellerman
Anton Blanchard writes: > From: Anton Blanchard > > Add a POWER9 optimised copy_page() loop. This loop uses the new D form > vector loads and stores, and uses dcbz to pre zero the destination. > ... > + > +#ifdef CONFIG_ALTIVEC > + mflrr0 > + std

Re: [PATCH] powerpc: Add POWER9 copy_page() loop

2017-04-02 Thread Benjamin Herrenschmidt
On Mon, 2017-04-03 at 10:54 +1000, Anton Blanchard wrote: > > > Good idea, I hadn't thought of embedding it all in a feature > > > section.   > > > > It may not work currently because you get those ftr_alt_97 relocation > > errors with the "else" parts because relative branches to other code > >

Re: [PATCH] powerpc: Add POWER9 copy_page() loop

2017-04-02 Thread Anton Blanchard
Hi Nick, > > Good idea, I hadn't thought of embedding it all in a feature > > section. > > It may not work currently because you get those ftr_alt_97 relocation > errors with the "else" parts because relative branches to other code > need to be direct and I think reachable from both places. I

Re: [PATCH] powerpc: Add POWER9 copy_page() loop

2017-03-20 Thread Nicholas Piggin
On Tue, 21 Mar 2017 15:01:03 +1100 Anton Blanchard wrote: > Hi Nick, > > > I've got a patch that makes alternate feature patching a bit > > more flexible and not hit relocation limits when using big "else" > > parts. I was thinking of doing something like > > > >

Re: [PATCH] powerpc: Add POWER9 copy_page() loop

2017-03-20 Thread Anton Blanchard
Hi Nick, > I've got a patch that makes alternate feature patching a bit > more flexible and not hit relocation limits when using big "else" > parts. I was thinking of doing something like > > _GLOBAL_TOC(copy_page) > BEGIN_FTR_SECTION_NESTED(50) > #include "copypage_power9.S" >

Re: [PATCH] powerpc: Add POWER9 copy_page() loop

2017-03-20 Thread Nicholas Piggin
On Tue, 21 Mar 2017 10:40:46 +1100 Anton Blanchard wrote: > From: Anton Blanchard > > Add a POWER9 optimised copy_page() loop. This loop uses the new D form > vector loads and stores, and uses dcbz to pre zero the destination. > > A few questions: > > - I'm

[PATCH] powerpc: Add POWER9 copy_page() loop

2017-03-20 Thread Anton Blanchard
From: Anton Blanchard Add a POWER9 optimised copy_page() loop. This loop uses the new D form vector loads and stores, and uses dcbz to pre zero the destination. A few questions: - I'm using a nested feature section, but that is going to get unwieldy at some stage. It would