[
https://issues.apache.org/jira/browse/PIG-4810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15328798#comment-15328798
]
Xianda Ke commented on PIG-4810:
--------------------------------
Hi [~kellyzly], Thanks for your comments.
1. setReplication() make sense. Thanks.
2. MergeJoin require sorted data as input. MergeJoin optimization will fail UT.
That why ORDER query is added.
3. I will fix indent issue.
I will update the patch soon.
> Implement Merge join for spark engine
> -------------------------------------
>
> Key: PIG-4810
> URL: https://issues.apache.org/jira/browse/PIG-4810
> Project: Pig
> Issue Type: Sub-task
> Components: spark
> Reporter: liyunzhang_intel
> Assignee: Xianda Ke
> Fix For: spark-branch
>
> Attachments: PIG-4810-2.patch, PIG-4810-3.patch, PIG-4810-4.patch,
> PIG-4810-5.patch, PIG-4810.patch
>
>
> In current code base(a9151ac), we use regular join to implement merge join in
> spark mode.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)