I'll post my solution in a few hours =)

On Tue, Apr 19, 2011 at 3:02 PM, John Hui <[email protected]> wrote:

> I don't think one parser will work for all solution.  It really depends on
> your data, since there might be a list within a list.
>
> But pick anyone as a starting point and customize it for your own json data
> format.
>
>
> On Tue, Apr 19, 2011 at 3:00 PM, Alan Gates <[email protected]> wrote:
>
>>
>> On Apr 19, 2011, at 11:44 AM, Daniel Eklund wrote:
>>
>>  <snip>
>>>
>>> A quick question about the UDF's registered at the top of a pig script:
>>>
>>> does
>>> REGISTER myJar.jar
>>> distribute the jar across HDFS (like a Hadoop job jar) so that the
>>> distribution of the code to the cluster nodes is transparent?
>>> In other words, do we NOT have to distribute myJar.jar to each node on
>>> the
>>> cluster.
>>>
>>
>> Pig takes care of getting myJar.jar to the task nodes; you do not have to
>> worry about it.
>>
>> Alan.
>>
>>
>

Reply via email to