Grant Henke has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/14197 )

Change subject: IMPALA-5092 Add support for VARCHAR in Kudu tables
......................................................................


Patch Set 14:

(3 comments)

http://gerrit.cloudera.org:8080/#/c/14197/14//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/14197/14//COMMIT_MSG@27
PS14, Line 27: IMPALA-5675 tracks adding UTF-8 Character length support to 
VARCHAR
             : columns and marked the truncation code with a TODO that 
references
             : that Jira.
> UTF-8 has the nice property that sorting by (unsigned) byte values is equiv
Done


http://gerrit.cloudera.org:8080/#/c/14197/14//COMMIT_MSG@33
PS14, Line 33: * Manually reproduced a check failure due to multi-byte 
characters
             :   and tested that length truncation resolve that issue.
> This would be a good sanity check, if only to prevent future breakage. But 
I started writing a test in test_kudu.py, but it looks like I would need to 
update the testing framework to handle UTF-8 as well:

   common/test_result_verifier.py:493: in parse_result_rows
      col = cols[i].encode('unicode_escape')
   E   UnicodeDecodeError: 'ascii' codec can't decode byte 0xe6 in position 0: 
ordinal not in range(128)

I will work on this automated test in a follow on patch. I opened IMPALA-9583 
to track this.


http://gerrit.cloudera.org:8080/#/c/14197/14//COMMIT_MSG@47
PS14, Line 47: support
> I don't think there are major changes needed, but definitely some changes,
I opened jiras to track this follow on work and will update the commit message 
to reference them.



--
To view, visit http://gerrit.cloudera.org:8080/14197
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I0d4959410fdd882bfa980cb55e8a7837c7823da8
Gerrit-Change-Number: 14197
Gerrit-PatchSet: 14
Gerrit-Owner: Attila Bukor <[email protected]>
Gerrit-Reviewer: Attila Bukor <[email protected]>
Gerrit-Reviewer: Csaba Ringhofer <[email protected]>
Gerrit-Reviewer: Grant Henke <[email protected]>
Gerrit-Reviewer: Impala Public Jenkins <[email protected]>
Gerrit-Reviewer: Tamas Mate <[email protected]>
Gerrit-Reviewer: Thomas Tauber-Marshall <[email protected]>
Gerrit-Reviewer: Tim Armstrong <[email protected]>
Gerrit-Comment-Date: Tue, 31 Mar 2020 16:56:58 +0000
Gerrit-HasComments: Yes

Reply via email to