[ 
https://issues.apache.org/jira/browse/MAPREDUCE-157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12755585#action_12755585
 ] 

eric baldeschwieler commented on MAPREDUCE-157:
-----------------------------------------------

Doug, so your patch will store binary AVRO?  If we can convert, we should 
probably convert all the way to native AVRO.  That will be more tested / common 
than text AVRO and presumably require less storage.  Presumably AVRO will have 
a full toolset for dumping / exploring files like this, so the binary format 
should not be a problem?

> Job History log file format is not friendly for external tools.
> ---------------------------------------------------------------
>
>                 Key: MAPREDUCE-157
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-157
>             Project: Hadoop Map/Reduce
>          Issue Type: Sub-task
>    Affects Versions: 0.20.1
>            Reporter: Owen O'Malley
>            Assignee: Jothi Padmanabhan
>             Fix For: 0.21.0
>
>         Attachments: mapred-157-10Sep.patch, mapred-157-15Sep-v1.patch, 
> mapred-157-15Sep.patch, mapred-157-4Sep.patch, mapred-157-7Sep-v1.patch, 
> mapred-157-7Sep.patch, mapred-157-prelim.patch, MAPREDUCE-157-avro.patch, 
> MAPREDUCE-157-avro.patch
>
>
> Currently, parsing the job history logs with external tools is very difficult 
> because of the format. The most critical problem is that newlines aren't 
> escaped in the strings. That makes using tools like grep, sed, and awk very 
> tricky.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to