Re: [PATCH] D21021: [Clang][AVX512][BuiltIn]Adding intrinsics move_{sd|ss} to clang

Simon Pilgrim via cfe-commits Fri, 16 Sep 2016 12:33:12 -0700

RKSimon added a subscriber: RKSimon.

================
Comment at: lib/Headers/avx512fintrin.h:9124
@@ +9123,3 @@
+{
+  return (__m128) __builtin_ia32_movss_mask ((__v4sf) __A, (__v4sf) __B,
+               (__v4sf) __W,
----------------
delena wrote:
> please try the following:
> if (__U)
>   return __builtin_shuffle(A, B, (0, 5, 6, 7)); // may be you need to swap A 
> and B 
>  return W;
> 
> I know that the immediate code will be less optimal, but we can optimize it 
> later.
Any update on this? I currently have a patch (D24653) looking at removing the 
movss/movsd mask intrinsics as we should be able to do this with purely generic 
shuffles. I can help with the optimization if necessary.



https://reviews.llvm.org/D21021



_______________________________________________
cfe-commits mailing list
cfe-commits@lists.llvm.org
http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Re: [PATCH] D21021: [Clang][AVX512][BuiltIn]Adding intrinsics move_{sd|ss} to clang

Reply via email to