[ 
https://issues.apache.org/jira/browse/PIG-794?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12906671#action_12906671
 ] 

Jeff Zhang commented on PIG-794:
--------------------------------

Dmitriy,

In my patch I turn InternalMap as an avro array whose element is a record 
having two datums(one is key and the other is value).
But it occurred weird exception , not know what's wrong with my code 


{code}
Exception in thread "main" java.lang.NullPointerException
        at org.apache.avro.io.parsing.Parser.advance(Parser.java:86)
        at 
org.apache.avro.io.ResolvingDecoder.readFieldOrder(ResolvingDecoder.java:121)
        at 
org.apache.pig.impl.io.avro.PigDataRecordReader.readRecord(PigDataRecordReader.java:77)
        at 
org.apache.avro.generic.GenericDatumReader.read(GenericDatumReader.java:106)
        at 
org.apache.pig.impl.io.avro.PigDataRecordReader.readRecord(PigDataRecordReader.java:66)
        at 
org.apache.avro.generic.GenericDatumReader.read(GenericDatumReader.java:106)
        at 
org.apache.avro.generic.GenericDatumReader.readArray(GenericDatumReader.java:184)
        at 
org.apache.avro.generic.GenericDatumReader.read(GenericDatumReader.java:108)
        at 
org.apache.pig.impl.io.avro.PigDataRecordReader.readRecord(PigDataRecordReader.java:81)
        at 
org.apache.avro.generic.GenericDatumReader.read(GenericDatumReader.java:106)
        at 
org.apache.avro.generic.GenericDatumReader.readArray(GenericDatumReader.java:184)
        at 
org.apache.avro.generic.GenericDatumReader.read(GenericDatumReader.java:108)
        at 
org.apache.pig.impl.io.avro.PigDataRecordReader.readRecord(PigDataRecordReader.java:83)
        at 
org.apache.avro.generic.GenericDatumReader.read(GenericDatumReader.java:106)
        at 
org.apache.avro.generic.GenericDatumReader.read(GenericDatumReader.java:97)
        at org.apache.avro.file.DataFileStream.next(DataFileStream.java:198)
        at org.apache.avro.file.DataFileStream.next(DataFileStream.java:185)
        at org.apache.pig.impl.io.avro.PigData.main(PigData.java:224)

{code}

> Use Avro serialization in Pig
> -----------------------------
>
>                 Key: PIG-794
>                 URL: https://issues.apache.org/jira/browse/PIG-794
>             Project: Pig
>          Issue Type: Improvement
>          Components: impl
>    Affects Versions: 0.2.0
>            Reporter: Rakesh Setty
>            Assignee: Dmitriy V. Ryaboy
>         Attachments: avro-0.1-dev-java_r765402.jar, AvroStorage.patch, 
> AvroStorage_2.patch, AvroStorage_3.patch, AvroStorage_4.patch, AvroTest.java, 
> jackson-asl-0.9.4.jar, PIG-794.patch
>
>
> We would like to use Avro serialization in Pig to pass data between MR jobs 
> instead of the current BinStorage. Attached is an implementation of 
> AvroBinStorage which performs significantly better compared to BinStorage on 
> our benchmarks.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to