[Bug target/94908] Failure to optimally optimize certain shuffle patterns

2023-04-18 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94908 --- Comment #12 from Uroš Bizjak --- Implemented also for x86.

[Bug target/94908] Failure to optimally optimize certain shuffle patterns

2023-04-18 Thread cvs-commit at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94908 --- Comment #11 from CVS Commits --- The master branch has been updated by Uros Bizjak : https://gcc.gnu.org/g:95b99e47f4f2df2d0c5680f45e3ec0a3170218ad commit r14-47-g95b99e47f4f2df2d0c5680f45e3ec0a3170218ad Author: Uros Bizjak Date: Tue

[Bug target/94908] Failure to optimally optimize certain shuffle patterns

2023-03-09 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94908 Uroš Bizjak changed: What|Removed |Added Attachment #54607|0 |1 is obsolete|

[Bug target/94908] Failure to optimally optimize certain shuffle patterns

2023-03-09 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94908 --- Comment #9 from Uroš Bizjak --- (In reply to Hongtao.liu from comment #8) > I'm thinking of something like below so it can be matched both by > expand_vselect_vconcat in ix86_expand_vec_perm_const_1 and patterns created > by

[Bug target/94908] Failure to optimally optimize certain shuffle patterns

2023-03-08 Thread crazylht at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94908 --- Comment #8 from Hongtao.liu --- (In reply to Uroš Bizjak from comment #7) > Created attachment 54607 [details] > Proposed patch > > Patch in testing. > > Attached patch produces (-O2 -msse4.1): > > f: > subq$24, %rsp >

[Bug target/94908] Failure to optimally optimize certain shuffle patterns

2023-03-08 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94908 --- Comment #7 from Uroš Bizjak --- Created attachment 54607 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=54607=edit Proposed patch Patch in testing. Attached patch produces (-O2 -msse4.1): f: subq$24, %rsp xorl

[Bug target/94908] Failure to optimally optimize certain shuffle patterns

2023-02-20 Thread crazylht at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94908 --- Comment #6 from Hongtao.liu --- Yes, insertps can select any element from src and insert into any place of the dest. under sse4.1, x86 can generate vinsertps xmm0, xmm1, xmm0, 64 # xmm0 = xmm0[1],xmm1[1,2,3]

[Bug target/94908] Failure to optimally optimize certain shuffle patterns

2023-02-18 Thread ubizjak at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94908 Uroš Bizjak changed: What|Removed |Added CC||crazylht at gmail dot com --- Comment #5

[Bug target/94908] Failure to optimally optimize certain shuffle patterns

2023-02-17 Thread pinskia at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94908 Andrew Pinski changed: What|Removed |Added Severity|normal |enhancement See Also|