Zihao Ye has uploaded this change for review. ( http://gerrit.cloudera.org:8080/20926
Change subject: [WIP] IMPALA-12718: Support UTF-8 for trim() ...................................................................... [WIP] IMPALA-12718: Support UTF-8 for trim() Currently, the trim function (including BTRIM, LTRIM, RTRIM) cannot correctly handle strings containing multi-byte UTF-8 characters. Multi-byte UTF-8 characters are interpreted as multiple single-byte characters, leading to unexpected results. This patch provides UTF-8 support for the trim function, enabling these functions to correctly handle multi-byte UTF-8 characters as a whole. Testing: - TODO Change-Id: I5cfaffd71009f16eae75910af835bd2a34410856 --- M be/src/exprs/string-functions-ir.cc M be/src/exprs/string-functions.h M be/src/util/bit-util.h M be/src/util/string-util.cc 4 files changed, 184 insertions(+), 37 deletions(-) git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/26/20926/6 -- To view, visit http://gerrit.cloudera.org:8080/20926 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-MessageType: newchange Gerrit-Change-Id: I5cfaffd71009f16eae75910af835bd2a34410856 Gerrit-Change-Number: 20926 Gerrit-PatchSet: 6 Gerrit-Owner: Zihao Ye <[email protected]>
