On Fri, Feb 21, 2014 at 12:56 PM, Bill Schmidt <wschm...@linux.vnet.ibm.com> wrote: > Hi, > > The little-endian implementation of vec_sums is incorrect. I had > misread the specification and thought that the fields not containing the > result value were undefined, but in fact they are defined to contain > zero. My previous implementation used a vector splat to copy the field > from BE element 3 to LE element 3. The corrected implementation will > use a vector shift left to move the field and fill the remaining fields > with zeros. > > When I fixed this, I discovered I had also missed a use of > gen_altivec_vsumsws, which should now use gen_altivec_vsumsws_direct > instead. This is fixed in this patch as well. > > Bootstrapped and tested on powerpc64le-unknown-linux-gnu with no > regressions. Bootstrap and regression test on > powerpc64-unknown-linux-gnu is in progress. If no big-endian > regressions are found, is this ok for trunk?
Okay. Thanks, David