[
https://issues.apache.org/jira/browse/PIG-794?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12906671#action_12906671
]
Jeff Zhang commented on PIG-794:
--------------------------------
Dmitriy,
In my patch I turn InternalMap as an avro array whose element is a record
having two datums(one is key and the other is value).
But it occurred weird exception , not know what's wrong with my code
{code}
Exception in thread "main" java.lang.NullPointerException
at org.apache.avro.io.parsing.Parser.advance(Parser.java:86)
at
org.apache.avro.io.ResolvingDecoder.readFieldOrder(ResolvingDecoder.java:121)
at
org.apache.pig.impl.io.avro.PigDataRecordReader.readRecord(PigDataRecordReader.java:77)
at
org.apache.avro.generic.GenericDatumReader.read(GenericDatumReader.java:106)
at
org.apache.pig.impl.io.avro.PigDataRecordReader.readRecord(PigDataRecordReader.java:66)
at
org.apache.avro.generic.GenericDatumReader.read(GenericDatumReader.java:106)
at
org.apache.avro.generic.GenericDatumReader.readArray(GenericDatumReader.java:184)
at
org.apache.avro.generic.GenericDatumReader.read(GenericDatumReader.java:108)
at
org.apache.pig.impl.io.avro.PigDataRecordReader.readRecord(PigDataRecordReader.java:81)
at
org.apache.avro.generic.GenericDatumReader.read(GenericDatumReader.java:106)
at
org.apache.avro.generic.GenericDatumReader.readArray(GenericDatumReader.java:184)
at
org.apache.avro.generic.GenericDatumReader.read(GenericDatumReader.java:108)
at
org.apache.pig.impl.io.avro.PigDataRecordReader.readRecord(PigDataRecordReader.java:83)
at
org.apache.avro.generic.GenericDatumReader.read(GenericDatumReader.java:106)
at
org.apache.avro.generic.GenericDatumReader.read(GenericDatumReader.java:97)
at org.apache.avro.file.DataFileStream.next(DataFileStream.java:198)
at org.apache.avro.file.DataFileStream.next(DataFileStream.java:185)
at org.apache.pig.impl.io.avro.PigData.main(PigData.java:224)
{code}
> Use Avro serialization in Pig
> -----------------------------
>
> Key: PIG-794
> URL: https://issues.apache.org/jira/browse/PIG-794
> Project: Pig
> Issue Type: Improvement
> Components: impl
> Affects Versions: 0.2.0
> Reporter: Rakesh Setty
> Assignee: Dmitriy V. Ryaboy
> Attachments: avro-0.1-dev-java_r765402.jar, AvroStorage.patch,
> AvroStorage_2.patch, AvroStorage_3.patch, AvroStorage_4.patch, AvroTest.java,
> jackson-asl-0.9.4.jar, PIG-794.patch
>
>
> We would like to use Avro serialization in Pig to pass data between MR jobs
> instead of the current BinStorage. Attached is an implementation of
> AvroBinStorage which performs significantly better compared to BinStorage on
> our benchmarks.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.