Would be better to combine this asm enable with the corresponding asm patch
itself. I have pushed copy_cnt8, and enabled only that for now.
On Wed, Sep 10, 2014 at 3:28 PM, prav...@multicorewareinc.com wrote:
# HG changeset patch
# User Praveen Tiwari
# Date 1410343073 -19800
# Node ID
You can push 16x16 and 32x32 also they are good in performance but they
need a bit more improvement, I will be sending improvement patch soon.
Regards,
Praveen Tiwari
On Thu, Sep 11, 2014 at 11:29 AM, Deepthi Nandakumar
deep...@multicorewareinc.com wrote:
Would be better to combine this asm
Ok, thanks.
On Thu, Sep 11, 2014 at 11:47 AM, Praveen Tiwari
prav...@multicorewareinc.com wrote:
You can push 16x16 and 32x32 also they are good in performance but they
need a bit more improvement, I will be sending improvement patch soon.
Regards,
Praveen Tiwari
On Thu, Sep 11, 2014 at
# HG changeset patch
# User Praveen Tiwari
# Date 1410343073 -19800
# Node ID 2cd4a13086740728559fde3a176953e9aa4c0782
# Parent 7bc4db02ccc728f6e2ddedd036c96e3d37b90f22
copy_cnt: enable avx2 version of asm code
diff -r 7bc4db02ccc7 -r 2cd4a1308674 source/common/x86/asm-primitives.cpp
---