[
https://issues.apache.org/jira/browse/PIG-4884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15284851#comment-15284851
]
Rohini Palaniswamy commented on PIG-4884:
-----------------------------------------
Yes. Copied that code from PigCombiner.Combine. It is same behavior as
mapreduce - HADOOP-3226 Combiners can be called multiple times in both map and
reduce. Combiners will be called whenever merge is done during spill in the map.
> Tez needs to use DistinctCombiner.Combine
> -----------------------------------------
>
> Key: PIG-4884
> URL: https://issues.apache.org/jira/browse/PIG-4884
> Project: Pig
> Issue Type: Improvement
> Reporter: Rohini Palaniswamy
> Assignee: Rohini Palaniswamy
> Fix For: 0.16.0
>
> Attachments: PIG-4884-1.patch
>
>
> Was investigating a job slowness and realized that Tez is not using
> DistinctCombiner.Combine which is more efficient.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)