[
https://issues.apache.org/jira/browse/PIG-842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13547679#comment-13547679
]
Daniel Dai commented on PIG-842:
--------------------------------
I prefer a single PigStorage implementation for clarity and code maintenance
reason. For performance, we can do some optimization for single character case
which could bring the performance of single character delimit near equal.
> PigStorage should support multi-byte delimiters
> -----------------------------------------------
>
> Key: PIG-842
> URL: https://issues.apache.org/jira/browse/PIG-842
> Project: Pig
> Issue Type: Improvement
> Components: impl
> Affects Versions: 0.3.0
> Reporter: Santhosh Srinivasan
> Assignee: Jeff Markham
> Attachments: PigMultiByteJsonMetadata.java, PigMultiByteStorage.java,
> PigMultiByteTextOutputFormat.java
>
>
> Currently, PigStorage supports single byte delimiters. Users have requested
> mult-byte delimiters. There are performance implications with multi-byte
> delimiters. i.e., instead of looking for a single byte, PigStorage should
> look for a pattern ala BinStorage.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira