[
https://issues.apache.org/jira/browse/PIG-842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13625886#comment-13625886
]
Russell Jurney commented on PIG-842:
------------------------------------
Jeff, I just ran into this problem with:
A = LOAD 'data/mxm_reverse_mapping.txt' using PigStorage('<SEP>') as
(word:chararray, stem:chararray);
And it is inconvenient. Any progress update? I'd like to get this into Pig
0.12. Please let me know if I can assist.
> PigStorage should support multi-byte delimiters
> -----------------------------------------------
>
> Key: PIG-842
> URL: https://issues.apache.org/jira/browse/PIG-842
> Project: Pig
> Issue Type: Improvement
> Components: impl
> Affects Versions: 0.3.0
> Reporter: Santhosh Srinivasan
> Assignee: Jeff Markham
> Attachments: PigMultiByteJsonMetadata.java, PigMultiByteStorage.java,
> PigMultiByteTextOutputFormat.java
>
>
> Currently, PigStorage supports single byte delimiters. Users have requested
> mult-byte delimiters. There are performance implications with multi-byte
> delimiters. i.e., instead of looking for a single byte, PigStorage should
> look for a pattern ala BinStorage.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira