pig-user  

Re: Tuple and Datum implementations

Pete Wyckoff
Mon, 22 Sep 2008 17:00:08 -0700

So, my use case would be to use the Hive MetaStore and
Serializes/Deserializers to implement:

1. a new Pig storage class based on looking up the metadata from the
metastore
2. a new Tuple/datum/bag based on native Java list,bag, integer, ...
3. add a "describe" command to the grunt shell.

Here, I will assume that the "filename" passed in to bindTo in storage is
the name of the "table".

Is this a plausible implementation as I don't know much about the Pig
internals and would people find such an optional feature useful?

Thanks, pete



On 9/19/08 5:39 PM, "Olga Natkovich" <[EMAIL PROTECTED]> wrote:

> In types branch, we are doing a couple of things:
> 
> (1) Using native java types for scalar types and for DataMap;
> (2) Making Tuple and Bag interfaces so that new implementations can be
> written.
> 
> Olga
> 
>> -----Original Message-----
>> From: Pete Wyckoff [EMAIL PROTECTED]
>> Sent: Friday, September 19, 2008 5:35 PM
>> To: [EMAIL PROTECTED]
>> Subject: Tuple and Datum implementations
>> 
>> 
>> Is it possible to implement these using an object composed of
>> List, Commons bag, Integer, Double, String, Boolean and Byte?
>> (I will also have type information, so I don't need to endure
>> the cost of reflection.)
>> 
>> Also, is it possible to implement them using a record IO
>> Object and reflection?
>> 
>> Thanks, pete
>> 
>>