[ 
https://issues.apache.org/jira/browse/MAPREDUCE-157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Doug Cutting updated MAPREDUCE-157:
-----------------------------------

    Attachment: MAPREDUCE-157-avro.patch

> Honestly, the dozen or so *Event.java classes are begging to have a framework.

I am working to convert this patch to use Avro.  I have a schema that generates 
code for all of the event classes.  I am in the process of converting each of 
the event classes to be a wrapper around the generated class, providing 
constructors and accessor methods to the generated class.  The serialization 
will switch to using Avro binary.  If Avro 1.1 is released this week as 
expected, then this could be trivially changed to generate JSON instead.

> Job History log file format is not friendly for external tools.
> ---------------------------------------------------------------
>
>                 Key: MAPREDUCE-157
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-157
>             Project: Hadoop Map/Reduce
>          Issue Type: Sub-task
>    Affects Versions: 0.20.1
>            Reporter: Owen O'Malley
>            Assignee: Jothi Padmanabhan
>             Fix For: 0.21.0
>
>         Attachments: mapred-157-10Sep.patch, mapred-157-4Sep.patch, 
> mapred-157-7Sep-v1.patch, mapred-157-7Sep.patch, mapred-157-prelim.patch, 
> MAPREDUCE-157-avro.patch, MAPREDUCE-157-avro.patch
>
>
> Currently, parsing the job history logs with external tools is very difficult 
> because of the format. The most critical problem is that newlines aren't 
> escaped in the strings. That makes using tools like grep, sed, and awk very 
> tricky.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to