[ 
http://issues.apache.org/jira/browse/HADOOP-175?page=comments#action_12377027 ] 

Doug Cutting commented on HADOOP-175:
-------------------------------------

Also, we could add a generic dumper that sniffs the magic number of a file and 
dumps it accordingly.  If it's a file that begins with {'S', 'E', 'Q' , 3} then 
it's a sequence file, if its a directory with sequence files named "index" and 
"data", then its a map file, if none of the first 100 bytes are less than 32, 
then its text, etc.

> Utilities for reading SequenceFile and MapFile
> ----------------------------------------------
>
>          Key: HADOOP-175
>          URL: http://issues.apache.org/jira/browse/HADOOP-175
>      Project: Hadoop
>         Type: Improvement

>   Components: io
>     Reporter: Andrzej Bialecki 
>     Priority: Minor
>  Attachments: patch.txt
>
> Most data in Hadoop is stored in SequenceFile-s and MapFile-s. Sometimes 
> there is a need to examine such files, but no specialized utilities exist ro 
> read them.
> These two classes provide a functionality to examine individual records in 
> such files, and also to dump the content of such files to a plain text output.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira

Reply via email to