There are two other JIRAs with alternate Avro<-->Pig implementations with 
different feature sets.

https://issues.apache.org/jira/browse/PIG-794 aims to use Avro internally 
within Pig for efficiency, including intermediate serializatoin.

https://issues.apache.org/jira/browse/AVRO-592 has the same goals that your 
patch does, but has fewer restrictions on what can and can't be written/read.  
It supports writing any Pig schema and reading it back in, but only reading a 
subset of Avro schemas (non-recursive; I may add unions later).  With a little 
more work it could support intermediate serialization for pig as well.   Longer 
term goals include being able to use AvroStorage along with a Hive AvroSerDe on 
the same data, supporting projection, and supporting partitioning.

I've been hoping to finish up AVRO-592 but am currently busy with other things.

-Scott

On Nov 30, 2010, at 9:05 PM, Lin Guo wrote:

> Hi,
> 
> We'd like to patch our pig AvroStorage function and
> would highly appreciate any kinds of comments.
> 
> doc:
> http://snaprojects.jira.com/wiki/display/HTOOLS/AvroStorage+-+Pig+support+for+Avro+data
> 
> jira:
> https://issues.apache.org/jira/browse/PIG-1748
> 
> Many thanks,
> Lin

Reply via email to