[ 
https://issues.apache.org/jira/browse/PIG-1084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12785407#action_12785407
 ] 

Olga Natkovich commented on PIG-1084:
-------------------------------------

Both Accumulator and Alebraic UDFs are described in the UDF manual. They are 
both intended to improve performance and as such both need to be mentioned in 
the cookbook and then refered to the manual. Something like:

If your UDF can't be made Algebraic but is able to deal with getting input in 
chunks rather than all at once, consider implementing Aggregator interface 
[link to UDF manual here] to reduce the amount of memory used by your script. 
Even if your function is Algebraic but can be used on conjunction with 
Accumulator functions, it needs to implement the Accumulator interface.

> Pig CookBook documentation "Take Advantage of Join Optimization" 
> additions:Merge and Skewed Join
> ------------------------------------------------------------------------------------------------
>
>                 Key: PIG-1084
>                 URL: https://issues.apache.org/jira/browse/PIG-1084
>             Project: Pig
>          Issue Type: Bug
>          Components: documentation
>    Affects Versions: 0.6.0
>            Reporter: Viraj Bhat
>            Assignee: Corinne Chandel
>             Fix For: 0.6.0
>
>         Attachments: cookbook.patch
>
>
> Hi all,
>  We have a host of Join optimizations that have been implemented recently in 
> Pig to improve performance. These include:
> http://hadoop.apache.org/pig/docs/r0.5.0/piglatin_reference.html#JOIN
> 1) Merge Join
> 2) Skewed Join
> It would be nice to mention the Merge Join and Skewed join in the following 
> section on the PigCookBook
> http://hadoop.apache.org/pig/docs/r0.5.0/cookbook.html#Take+Advantage+of+Join+Optimization
> Can we update this release 0.6??
> Thanks
> Viraj

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to