[
http://issues.apache.org/jira/browse/HADOOP-175?page=comments#action_12377027 ]
Doug Cutting commented on HADOOP-175:
-------------------------------------
Also, we could add a generic dumper that sniffs the magic number of a file and
dumps it accordingly. If it's a file that begins with {'S', 'E', 'Q' , 3} then
it's a sequence file, if its a directory with sequence files named "index" and
"data", then its a map file, if none of the first 100 bytes are less than 32,
then its text, etc.
> Utilities for reading SequenceFile and MapFile
> ----------------------------------------------
>
> Key: HADOOP-175
> URL: http://issues.apache.org/jira/browse/HADOOP-175
> Project: Hadoop
> Type: Improvement
> Components: io
> Reporter: Andrzej Bialecki
> Priority: Minor
> Attachments: patch.txt
>
> Most data in Hadoop is stored in SequenceFile-s and MapFile-s. Sometimes
> there is a need to examine such files, but no specialized utilities exist ro
> read them.
> These two classes provide a functionality to examine individual records in
> such files, and also to dump the content of such files to a plain text output.
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
http://www.atlassian.com/software/jira