http://llvm.org/bugs/show_bug.cgi?id=17679
Bug ID: 17679
Summary: [Vectorizer] Implement interleaved stride
vectorization
Product: libraries
Version: trunk
Hardware: PC
OS: Linux
Status: NEW
Severity: normal
Priority: P
Component: Loop Optimizer
Assignee: [email protected]
Reporter: [email protected]
CC: [email protected]
Classification: Unclassified
After implementing the non-interleaved vectorization, detect interleaved
patterns, such as:
for (i..N/3) {
a[3*i] = b[3*i] + I;
a[3*i+1] = b[3*i+1] + J;
a[3*i+2] = b[3*i+2] + K;
}
Where you can unroll this into:
for (i..N/3) {
a[3*i] = b[3*i] + I;
a[3*i+3] = b[3*i+3] + I;
a[3*i+6] = b[3*i+6] + I;
a[3*i+9] = b[3*i+9] + I;
a[3*i+1] = b[3*i+1] + J;
a[3*i+4] = b[3*i+4] + J;
a[3*i+7] = b[3*i+7] + J;
a[3*i+10] = b[3*i+10] + J;
a[3*i+2] = b[3*i+2] + K;
a[3*i+5] = b[3*i+5] + K;
a[3*i+8] = b[3*i+8] + K;
a[3*i+11] = b[3*i+11] + K;
}
And use interleaved reads/writes, for example ARM's VLDN/VSTN.
--
You are receiving this mail because:
You are on the CC list for the bug.
_______________________________________________
LLVMbugs mailing list
[email protected]
http://lists.cs.uiuc.edu/mailman/listinfo/llvmbugs