[
https://issues.apache.org/jira/browse/PIG-2494?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13264628#comment-13264628
]
Dmitriy V. Ryaboy commented on PIG-2494:
----------------------------------------
Note that a far more powerful version of a Sequence File Loader is available in
Elephant-Bird: https://github.com/kevinweil/elephant-bird
This is a pretty small patch, though. Good one to practice patch submission on,
if someone wanted to post it using the procedure Joey linked to above.
> Improvement to SequenceFileLoader (NullWritable and Delimiter)
> --------------------------------------------------------------
>
> Key: PIG-2494
> URL: https://issues.apache.org/jira/browse/PIG-2494
> Project: Pig
> Issue Type: Improvement
> Components: piggybank
> Affects Versions: 0.9.1
> Environment: All
> Reporter: Ted Malaska
> Priority: Minor
> Labels: newbie, simple
> Attachments: SequenceFileLoader.java
>
>
> I wanted to add two features to SequenceFileLoader.
> 1. I added a delimiter so it will act more like PigStorage, in that it
> will Split the value if it is a type Text (chararray).
> 2. I added the option of the key being a NullWritable. I wanted to be
> able to process my Hive files in both Hive and Pig, but because my Hive
> sequence files have a NullWritable key I could not make this work with the
> current implementation of SequenceFileLoader.
> My change is attached to this Issue.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira