http://llvm.org/bugs/show_bug.cgi?id=17679

            Bug ID: 17679
           Summary: [Vectorizer] Implement interleaved stride
                    vectorization
           Product: libraries
           Version: trunk
          Hardware: PC
                OS: Linux
            Status: NEW
          Severity: normal
          Priority: P
         Component: Loop Optimizer
          Assignee: [email protected]
          Reporter: [email protected]
                CC: [email protected]
    Classification: Unclassified

After implementing the non-interleaved vectorization, detect interleaved
patterns, such as:

for (i..N/3) {
 a[3*i] = b[3*i] + I;
 a[3*i+1] = b[3*i+1] + J;
 a[3*i+2] = b[3*i+2] + K;
}

Where you can unroll this into:

for (i..N/3) {
 a[3*i] = b[3*i] + I;
 a[3*i+3] = b[3*i+3] + I;
 a[3*i+6] = b[3*i+6] + I;
 a[3*i+9] = b[3*i+9] + I;

 a[3*i+1] = b[3*i+1] + J;
 a[3*i+4] = b[3*i+4] + J;
 a[3*i+7] = b[3*i+7] + J;
 a[3*i+10] = b[3*i+10] + J;

 a[3*i+2] = b[3*i+2] + K;
 a[3*i+5] = b[3*i+5] + K;
 a[3*i+8] = b[3*i+8] + K;
 a[3*i+11] = b[3*i+11] + K;
}

And use interleaved reads/writes, for example ARM's VLDN/VSTN.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
_______________________________________________
LLVMbugs mailing list
[email protected]
http://lists.cs.uiuc.edu/mailman/listinfo/llvmbugs

Reply via email to