Re: [ARM] implement division using vrecpe/vrecps with -funsafe-math-optimizations

2016-06-10 Thread Prathamesh Kulkarni
On 7 June 2016 at 14:07, Ramana Radhakrishnan wrote: >>> Please find the updated patch attached. >>> It passes testsuite for arm-none-linux-gnueabi, arm-none-linux-gnueabihf and >>> arm-none-eabi. >>> However the test-case added in the patch (neon-vect-div-1.c) fails to

Re: [ARM] implement division using vrecpe/vrecps with -funsafe-math-optimizations

2016-06-07 Thread Ramana Radhakrishnan
>> Please find the updated patch attached. >> It passes testsuite for arm-none-linux-gnueabi, arm-none-linux-gnueabihf and >> arm-none-eabi. >> However the test-case added in the patch (neon-vect-div-1.c) fails to >> get vectorized at -O2 >> for armeb-none-linux-gnueabihf. >> Charles suggested me

Re: [ARM] implement division using vrecpe/vrecps with -funsafe-math-optimizations

2016-06-07 Thread Prathamesh Kulkarni
On 30 May 2016 at 13:52, Prathamesh Kulkarni wrote: > On 23 May 2016 at 14:59, Prathamesh Kulkarni > wrote: >> On 5 February 2016 at 18:40, Prathamesh Kulkarni >> wrote: >>> On 4 February 2016 at

Re: [ARM] implement division using vrecpe/vrecps with -funsafe-math-optimizations

2016-05-30 Thread Prathamesh Kulkarni
On 23 May 2016 at 14:59, Prathamesh Kulkarni wrote: > On 5 February 2016 at 18:40, Prathamesh Kulkarni > wrote: >> On 4 February 2016 at 16:31, Ramana Radhakrishnan >> wrote: >>> On Sun, Jan 17, 2016 at

Re: [ARM] implement division using vrecpe/vrecps with -funsafe-math-optimizations

2016-05-23 Thread Prathamesh Kulkarni
On 5 February 2016 at 18:40, Prathamesh Kulkarni wrote: > On 4 February 2016 at 16:31, Ramana Radhakrishnan > wrote: >> On Sun, Jan 17, 2016 at 9:06 AM, Prathamesh Kulkarni >> wrote: >>> On 31 July 2015

Re: [ARM] implement division using vrecpe/vrecps with -funsafe-math-optimizations

2016-02-05 Thread Prathamesh Kulkarni
On 4 February 2016 at 16:31, Ramana Radhakrishnan wrote: > On Sun, Jan 17, 2016 at 9:06 AM, Prathamesh Kulkarni > wrote: >> On 31 July 2015 at 15:04, Ramana Radhakrishnan >> wrote: >>> >>> >>> On

Re: [ARM] implement division using vrecpe/vrecps with -funsafe-math-optimizations

2016-02-04 Thread Ramana Radhakrishnan
On Sun, Jan 17, 2016 at 9:06 AM, Prathamesh Kulkarni wrote: > On 31 July 2015 at 15:04, Ramana Radhakrishnan > wrote: >> >> >> On 29/07/15 11:09, Prathamesh Kulkarni wrote: >>> Hi, >>> This patch tries to implement division with

Re: [ARM] implement division using vrecpe/vrecps with -funsafe-math-optimizations

2016-01-17 Thread Prathamesh Kulkarni
On 31 July 2015 at 15:04, Ramana Radhakrishnan wrote: > > > On 29/07/15 11:09, Prathamesh Kulkarni wrote: >> Hi, >> This patch tries to implement division with multiplication by >> reciprocal using vrecpe/vrecps >> with -funsafe-math-optimizations and

Re: [ARM] implement division using vrecpe/vrecps with -funsafe-math-optimizations

2015-07-31 Thread Charles Baylis
On 31 July 2015 at 10:34, Ramana Radhakrishnan ramana.radhakrish...@foss.arm.com wrote: I've tried this in the past and never been convinced that 2 iterations are enough to get to stability with this given that the results are only precise for 8 bits / iteration. Thus I've always believed you

Re: [ARM] implement division using vrecpe/vrecps with -funsafe-math-optimizations

2015-07-31 Thread Ramana Radhakrishnan
On 29/07/15 11:09, Prathamesh Kulkarni wrote: Hi, This patch tries to implement division with multiplication by reciprocal using vrecpe/vrecps with -funsafe-math-optimizations and -freciprocal-math enabled. Tested on arm-none-linux-gnueabihf using qemu. OK for trunk ? Thank you,

Re: [ARM] implement division using vrecpe/vrecps with -funsafe-math-optimizations

2015-07-30 Thread Prathamesh Kulkarni
On 29 July 2015 at 16:03, Kyrill Tkachov kyrylo.tkac...@arm.com wrote: Hi Prathamesh, This is probably not appropriate for -Os optimisation. And for speed optimisation I imagine it can vary a lot on the target the code is run. Do you have any benchmark results for this patch? Hi Kyrill,

[ARM] implement division using vrecpe/vrecps with -funsafe-math-optimizations

2015-07-29 Thread Prathamesh Kulkarni
Hi, This patch tries to implement division with multiplication by reciprocal using vrecpe/vrecps with -funsafe-math-optimizations and -freciprocal-math enabled. Tested on arm-none-linux-gnueabihf using qemu. OK for trunk ? Thank you, Prathamesh 2015-07-28 Prathamesh Kulkarni

Re: [ARM] implement division using vrecpe/vrecps with -funsafe-math-optimizations

2015-07-29 Thread Kyrill Tkachov
Hi Prathamesh, This is probably not appropriate for -Os optimisation. And for speed optimisation I imagine it can vary a lot on the target the code is run. Do you have any benchmark results for this patch? Thanks, Kyrill On 29/07/15 11:09, Prathamesh Kulkarni wrote: Hi, This patch tries to