[
https://issues.apache.org/jira/browse/HADOOP-2501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12561233#action_12561233
]
eric baldeschwieler commented on HADOOP-2501:
---------------------------------------------
It would be great if we had a way of catting out the key/values from a sequence
file in the same format consumed by streaming.
It would be good if such a tool could take start and end offsets, so it could
be used with splits.
> Implement utility-tools for working with SequenceFiles
> ------------------------------------------------------
>
> Key: HADOOP-2501
> URL: https://issues.apache.org/jira/browse/HADOOP-2501
> Project: Hadoop
> Issue Type: New Feature
> Components: io
> Reporter: Arun C Murthy
> Assignee: Enis Soztutar
>
> It would be nice to implement a bunch of utilities to work with SequenceFiles:
> * info (print-out header information such as key/value types, compression
> type/codec etc.)
> * cat
> * head/tail
> * merge multiple seq-files into one
> * ...
> I'd imagine this would look like:
> {noformat}
> $ bin/hadoop seq -info /user/joe/blah.seq
> $ bin/hadoop seq -head -n 10 /user/joe/blah.seq
> {noformat}
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.