Youwei Wang has uploaded a new patch set (#6).

Change subject: IMPALA-2809: improve ByteSwap with builtin function or SSE or 
AVX2
......................................................................

IMPALA-2809: improve ByteSwap with builtin function or SSE or AVX2

Using SSE4.2/AVX2 intrinsic to accelerate the function
"static inline void ByteSwap(void* dst, const void* src, int len)" of BitUtil 
class,
and a scalar byte-swap routine is added as fallback.
Also the runtime selector for CPUs of different capacity is included,
as well as performance test and data verification.
Brief performance data is listed here:
CPU: Intel(R) Core(TM) i5-4460  [email protected]
Runtime consumed for 1000000000 bytes:
Impala ByteSwap: 1x
SSE4.2 ByteSwap: 2.699x
AVX2 Byteswap: 2.958x
Runtime consumed for 16 bytes:
Impala ByteSwap: 1x
Improved scalar swap: 2.721x

Change-Id: I392ed5a8d5683f30f161282c228c1aedd7b648c1
---
M be/src/benchmarks/CMakeLists.txt
M be/src/util/bit-util.inline.h
M be/src/util/cpu-info.cc
M be/src/util/cpu-info.h
4 files changed, 119 insertions(+), 30 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala refs/changes/81/3081/6
-- 
To view, visit http://gerrit.cloudera.org:8080/3081
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I392ed5a8d5683f30f161282c228c1aedd7b648c1
Gerrit-PatchSet: 6
Gerrit-Project: Impala
Gerrit-Branch: cdh5-trunk
Gerrit-Owner: Youwei Wang <[email protected]>
Gerrit-Reviewer: Alex Behm <[email protected]>
Gerrit-Reviewer: Jim Apple <[email protected]>
Gerrit-Reviewer: Marcel Kornacker <[email protected]>
Gerrit-Reviewer: Mostafa Mokhtar <[email protected]>
Gerrit-Reviewer: Tim Armstrong <[email protected]>
Gerrit-Reviewer: Youwei Wang <[email protected]>

Reply via email to