[ 
https://issues.apache.org/jira/browse/PIG-958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ankur updated PIG-958:
----------------------

    Attachment: 958.v3.patch

Pradeep,
                 Thanks for your review comments. I have incorporated the 
suggestions provided in the code review. The code is vastly simplified, cleaner 
and more readable :-). 

Unit test now pass in local mode but fail in cluster mode after taking an 
update of Pig code base. The error I see is :-
hdfs://localhost.localdomain:40352/user/gankur/output/_temporary/_attempt_20091009030519686_0001_m_000000_0/output,
 expected: file:///

Looks like a config issue with org.apache.pig.test.MiniCluster in the latest 
pig code. I didn't get time to debug this as I am going on a vacation. 
Regardless, I have attached the new patch for your review. Please suggest what 
needs to be done to pass the unit test in cluster mode.

-Ankur

> Splitting output data on key field
> ----------------------------------
>
>                 Key: PIG-958
>                 URL: https://issues.apache.org/jira/browse/PIG-958
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: 0.4.0
>            Reporter: Ankur
>         Attachments: 958.v3.patch
>
>
> Pig users often face the need to split the output records into a bunch of 
> files and directories depending on the type of record. Pig's SPLIT operator 
> is useful when record types are few and known in advance. In cases where type 
> is not directly known but is derived dynamically from values of a key field 
> in the output tuple, a custom store function is a better solution.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to