PigStorage does not properly handle UTF8 data
---------------------------------------------
Key: PIG-63
URL: https://issues.apache.org/jira/browse/PIG-63
Project: Pig
Issue Type: Bug
Reporter: Olga Natkovich
>From Ben:
I just checked the code and the problem seems to be PigStorage. getNext() uses
readLine() which does not handle UTF8 correctly. putNext() also uses default
encoder rather than UTF8 explicitly.
Internally and in BinStorage UTF8 appears to be handled correctly.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.