[ 
https://issues.apache.org/jira/browse/PIG-4884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15284851#comment-15284851
 ] 

Rohini Palaniswamy commented on PIG-4884:
-----------------------------------------

Yes. Copied that code from PigCombiner.Combine. It is same behavior as 
mapreduce - HADOOP-3226 Combiners can be called multiple times in both map and 
reduce. Combiners will be called whenever merge is done during spill in the map.

> Tez needs to use DistinctCombiner.Combine
> -----------------------------------------
>
>                 Key: PIG-4884
>                 URL: https://issues.apache.org/jira/browse/PIG-4884
>             Project: Pig
>          Issue Type: Improvement
>            Reporter: Rohini Palaniswamy
>            Assignee: Rohini Palaniswamy
>             Fix For: 0.16.0
>
>         Attachments: PIG-4884-1.patch
>
>
> Was investigating a job slowness and realized that Tez is not using 
> DistinctCombiner.Combine which is more efficient.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to