All, the attached patch file contains:

1.)                Updates {|v}movss/{|v}movsd (AVX/SSE) conforming to the isa 
for both SSE2 and AVX code.

2.)                Changes in copy detection and side effect detection for LRA 
and EBO.

In doing this, I updated the auto translation back to originating forms for 
movss, movsd, vmovss, vmovd so that intrinsics will map properly for all of 
x8664.
Also I updated the copy expansion code to use movaps under the same target 
guards that we did in the auto translation remap before.  The change produces 
correct code on BD and all SSE enabled x8664 architectures for expansion 
patterns and intrinsics.  The expanded SSE movs{s|d} instructions have been 
replaced with movdq in the generic path as it correctly fits the motif as a 
single source move.

These mov instructions are really merge operations that need to preserve a data 
path.

These changes pass the following:
a.)           No compile time failure for x86 build.
b.)          The gcc regression test suite on x86/Linux with no new failures(GH 
and BD).
c.)           The SPEC2006 test suite at with current AMD 1 copy config at both 
base and peak(GH and BD).

Thx,

m

Attachment: simd_move_update.p
Description: simd_move_update.p

------------------------------------------------------------------------------
5 Ways to Improve & Secure Unified Communications
Unified Communications promises greater efficiencies for business. UC can 
improve internal communications as well as offer faster, more efficient ways
to interact with customers and streamline customer service. Learn more!
http://www.accelacomm.com/jaw/sfnl/114/51426253/
_______________________________________________
Open64-devel mailing list
Open64-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/open64-devel

Reply via email to