Dan Hecht has posted comments on this change.

Change subject: IMPALA-2700: ASCII NUL characters are doubled on insert into 
text tables
......................................................................


Patch Set 5: Code-Review+2

(1 comment)

http://gerrit.cloudera.org:8080/#/c/3703/5//COMMIT_MSG
Commit Message:

PS5, Line 9: Currently the scanner processes the '\0' character as a no special 
character
           : whereas the writer treats it as a special character. The writer
           : appends a special character before writting which is causing the 
ASCII
           : NULL characters to double since they are the default escape 
characters.
           : This adds a check to treat '\0' as a no special character in the
           : writter.
This is confusing.  Is the following accurate?

The when the escape char is set to '\0', the scanner threads this as no escape. 
 However, the text writer currently treats '\0' as a valid escape character, 
and escape it when writing, causing a doubling of ASCII NUL characters.

The fix is to make the text writer treat escape char of '\0' as no escape.


-- 
To view, visit http://gerrit.cloudera.org:8080/3703
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-MessageType: comment
Gerrit-Change-Id: Ia30fa314d1ee1e99f9e7598466eb1570ca7940fc
Gerrit-PatchSet: 5
Gerrit-Project: Impala
Gerrit-Branch: cdh5-trunk
Gerrit-Owner: anujphadke <[email protected]>
Gerrit-Reviewer: Dan Hecht <[email protected]>
Gerrit-Reviewer: Tim Armstrong <[email protected]>
Gerrit-Reviewer: anujphadke <[email protected]>
Gerrit-HasComments: Yes

Reply via email to