[Bug target/57954] AVX missing vxorps (zeroing) before vcvtsi2s %edx, slow down AVX code

2013-12-31 Thread glisse at gcc dot gnu.org
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=57954 Marc Glisse glisse at gcc dot gnu.org changed: What|Removed |Added CC||ylow at graphlab

[Bug target/57954] AVX missing vxorps (zeroing) before vcvtsi2s %edx, slow down AVX code

2013-07-29 Thread ubizjak at gmail dot com
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=57954 Uroš Bizjak ubizjak at gmail dot com changed: What|Removed |Added Target||x86

[Bug target/57954] AVX missing vxorps (zeroing) before vcvtsi2s %edx, slow down AVX code

2013-07-29 Thread ubizjak at gmail dot com
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=57954 --- Comment #7 from Uroš Bizjak ubizjak at gmail dot com --- Fixed.

[Bug target/57954] AVX missing vxorps (zeroing) before vcvtsi2s %edx, slow down AVX code

2013-07-29 Thread vincenzo.innocente at cern dot ch
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=57954 --- Comment #8 from vincenzo Innocente vincenzo.innocente at cern dot ch --- thanks for getting in the trunk. will be possible to back port to at least 4.8? (this issue is there till 4.4!)

[Bug target/57954] AVX missing vxorps (zeroing) before vcvtsi2s %edx, slow down AVX code

2013-07-29 Thread ysrumyan at gmail dot com
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=57954 Yuri Rumyantsev ysrumyan at gmail dot com changed: What|Removed |Added CC||ysrumyan at

[Bug target/57954] AVX missing vxorps (zeroing) before vcvtsi2s %edx, slow down AVX code

2013-07-29 Thread ubizjak at gmail dot com
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=57954 --- Comment #10 from Uroš Bizjak ubizjak at gmail dot com --- (In reply to Yuri Rumyantsev from comment #9) I assume that this fix is not good and must be reverted - I will prepare another fix for your reviewing. There are at least 2 problems:

[Bug target/57954] AVX missing vxorps (zeroing) before vcvtsi2s %edx, slow down AVX code

2013-07-29 Thread dushistov at mail dot ru
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=57954 --- Comment #11 from Evgeniy Dushistov dushistov at mail dot ru --- (In reply to Yuri Rumyantsev from comment #9) I checked that zeroing of xmm register before conversion leads to performance slowdown on SLM (-5%) for proveded test-case. I

[Bug target/57954] AVX missing vxorps (zeroing) before vcvtsi2s %edx, slow down AVX code

2013-07-27 Thread vincenzo.innocente at cern dot ch
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=57954 --- Comment #5 from vincenzo Innocente vincenzo.innocente at cern dot ch --- confirmed that the patch fixes the issue c++ -O2 -march=corei7-avx polyAVX.cpp time ./a.out 10358474048 2.965u 0.001s 0:02.97 99.6%0+0k 0+0io 146pf+0w

[Bug target/57954] AVX missing vxorps (zeroing) before vcvtsi2s %edx, slow down AVX code

2013-07-26 Thread hjl.tools at gmail dot com
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=57954 H.J. Lu hjl.tools at gmail dot com changed: What|Removed |Added CC||hjl.tools at gmail

[Bug target/57954] AVX missing vxorps (zeroing) before vcvtsi2s %edx, slow down AVX code

2013-07-26 Thread dushistov at mail dot ru
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=57954 --- Comment #3 from Evgeniy Dushistov dushistov at mail dot ru --- Great, I tested the patch, at now pi calculation as fast as in icc, and two times faster then in clang 3.3.

[Bug target/57954] AVX missing vxorps (zeroing) before vcvtsi2s %edx, slow down AVX code

2013-07-26 Thread hjl.tools at gmail dot com
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=57954 H.J. Lu hjl.tools at gmail dot com changed: What|Removed |Added Attachment #30560|0 |1 is

[Bug target/57954] AVX missing vxorps (zeroing) before vcvtsi2s %edx, slow down AVX code

2013-07-25 Thread hjl.tools at gmail dot com
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=57954 H.J. Lu hjl.tools at gmail dot com changed: What|Removed |Added CC||dushistov at mail dot