Zihao Ye has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/20926


Change subject: [WIP] IMPALA-12718: Support UTF-8 for trim()
......................................................................

[WIP] IMPALA-12718: Support UTF-8 for trim()

Currently, the trim function (including BTRIM, LTRIM, RTRIM) cannot
correctly handle strings containing multi-byte UTF-8 characters.
Multi-byte UTF-8 characters are interpreted as multiple single-byte
characters, leading to unexpected results. This patch provides UTF-8
support for the trim function, enabling these functions to correctly
handle multi-byte UTF-8 characters as a whole.

Testing:
 - TODO

Change-Id: I5cfaffd71009f16eae75910af835bd2a34410856
---
M be/src/exprs/string-functions-ir.cc
M be/src/exprs/string-functions.h
M be/src/util/bit-util.h
M be/src/util/string-util.cc
4 files changed, 184 insertions(+), 37 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/26/20926/6
--
To view, visit http://gerrit.cloudera.org:8080/20926
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I5cfaffd71009f16eae75910af835bd2a34410856
Gerrit-Change-Number: 20926
Gerrit-PatchSet: 6
Gerrit-Owner: Zihao Ye <[email protected]>

Reply via email to