Russell Jurney created PIG-3111:
-----------------------------------

             Summary: ToAvro to convert any Pig record to an Avro bytearray
                 Key: PIG-3111
                 URL: https://issues.apache.org/jira/browse/PIG-3111
             Project: Pig
          Issue Type: New Feature
          Components: data, internal-udfs
    Affects Versions: 0.12
            Reporter: Russell Jurney
            Assignee: Russell Jurney
             Fix For: 0.12


I want to create a ToAvro() builtin that converts arbitrary pig fields, 
including complex types (bags, tuples, maps) to avro format as bytearrays.

This would enable storing Avro records in arbitrary data stores, for example 
HBaseAvroStorage in PIG-2889

See PIG-2641 for ToJson

This points to a greater need for customizable/pluggable serialization that 
plugin to storefuncs and do serialization independently. For example, we might 
do these operations:

a = load 'my_data' as (some_schema);
b = foreach a generate ToJson(*);
c = foreach a generate ToAvro(*);
store b into 'hbase://JsonValueTable' using HBaseStorage(...);
store c into 'hbase://AvroValueTable' using HBaseStorage(...);

I'll make a ticket for pluggable serialization separately.


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to