Adar Dembo has submitted this change and it was merged. ( http://gerrit.cloudera.org:8080/14354 )
Change subject: KUDU-1938 Make UTF-8 truncation faster pt 2 ...................................................................... KUDU-1938 Make UTF-8 truncation faster pt 2 Adds Intel Intrinsics (up to SSE4.2) to speed up the processing of UTF8 character counting in the case of ASCII-only chunks (fast path) by doubling the chunk size in a single pass from 64 to 128 bits. Before: [ RUN ] CharUtilTest.StressTestUtf8 [ OK ] CharUtilTest.StressTestUtf8 (7746 ms) [ RUN ] CharUtilTest.StressTestAscii [ OK ] CharUtilTest.StressTestAscii (1028 ms) After: [ RUN ] CharUtilTest.StressTestUtf8 [ OK ] CharUtilTest.StressTestUtf8 (9285 ms) [ RUN ] CharUtilTest.StressTestAscii [ OK ] CharUtilTest.StressTestAscii (708 ms) Change-Id: I9a491157dd5c8b4815030bbda921a0afc0bafd28 Reviewed-on: http://gerrit.cloudera.org:8080/14354 Reviewed-by: Adar Dembo <[email protected]> Tested-by: Kudu Jenkins --- M src/kudu/util/char_util-test.cc M src/kudu/util/char_util.cc 2 files changed, 36 insertions(+), 4 deletions(-) Approvals: Adar Dembo: Looks good to me, approved Kudu Jenkins: Verified -- To view, visit http://gerrit.cloudera.org:8080/14354 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-Project: kudu Gerrit-Branch: master Gerrit-MessageType: merged Gerrit-Change-Id: I9a491157dd5c8b4815030bbda921a0afc0bafd28 Gerrit-Change-Number: 14354 Gerrit-PatchSet: 17 Gerrit-Owner: Attila Bukor <[email protected]> Gerrit-Reviewer: Adar Dembo <[email protected]> Gerrit-Reviewer: Attila Bukor <[email protected]> Gerrit-Reviewer: Grant Henke <[email protected]> Gerrit-Reviewer: Kudu Jenkins (120)
