PigStorage does not properly handle UTF8 data
---------------------------------------------

                 Key: PIG-63
                 URL: https://issues.apache.org/jira/browse/PIG-63
             Project: Pig
          Issue Type: Bug
            Reporter: Olga Natkovich


>From Ben:

I just checked the code and the problem seems to be PigStorage. getNext() uses
readLine() which does not handle UTF8 correctly. putNext() also uses default 
encoder rather than UTF8 explicitly.

Internally and in BinStorage UTF8 appears to be handled correctly.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to