[
https://issues.apache.org/jira/browse/PIG-4657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14697869#comment-14697869
]
Daniel Dai commented on PIG-4657:
---------------------------------
Wow, that's much more than I expected. I was hoping rawcomparator speedup by
20%.
> [Pig on Tez] Optimize GroupBy and Distinct key comparison
> ---------------------------------------------------------
>
> Key: PIG-4657
> URL: https://issues.apache.org/jira/browse/PIG-4657
> Project: Pig
> Issue Type: Sub-task
> Reporter: Rohini Palaniswamy
> Assignee: Rohini Palaniswamy
> Fix For: 0.16.0
>
> Attachments: PIG-4657-1.patch
>
>
> While bytes comparator cannot be used for joins till TEZ-2715 is
> available, they can be used for group by and distinct if they have only one
> Tez input. If there is more than one input due to union optimization
> (OrderedGroupedMergedKVInput) , full comparator has to be still used as
> OrderedGroupedMergedKVInput uses the comparator to merge the two underlying
> inputs.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)