Re: [PATCH, rs6000] vec_sums must define all result vector elements

David Edelsohn Fri, 21 Feb 2014 10:28:25 -0800

On Fri, Feb 21, 2014 at 12:56 PM, Bill Schmidt
<wschm...@linux.vnet.ibm.com> wrote:
> Hi,
>
> The little-endian implementation of vec_sums is incorrect.  I had
> misread the specification and thought that the fields not containing the
> result value were undefined, but in fact they are defined to contain
> zero.  My previous implementation used a vector splat to copy the field
> from BE element 3 to LE element 3.  The corrected implementation will
> use a vector shift left to move the field and fill the remaining fields
> with zeros.
>
> When I fixed this, I discovered I had also missed a use of
> gen_altivec_vsumsws, which should now use gen_altivec_vsumsws_direct
> instead.  This is fixed in this patch as well.
>
> Bootstrapped and tested on powerpc64le-unknown-linux-gnu with no
> regressions.  Bootstrap and regression test on
> powerpc64-unknown-linux-gnu is in progress.  If no big-endian
> regressions are found, is this ok for trunk?


Okay.
Thanks, David

Re: [PATCH, rs6000] vec_sums must define all result vector elements

Reply via email to