[
https://issues.apache.org/jira/browse/PIG-3527?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Mark Wagner updated PIG-3527:
-----------------------------
Attachment: PIG-3527.1.patch
Here's an initial patch.There are some things that I need to clean up, and I've
made notes of these with TODOs I've posted a review at
https://reviews.apache.org/r/15194/. One interesting thing to note is that
after attaching inputs directly to the operator pipeline, I observed an ~%40
speedup. I believe this is because there aren't so many calls returning
STATUS_EOP, but I haven't tested this.
> Allow PigProcessor to handle multiple inputs
> --------------------------------------------
>
> Key: PIG-3527
> URL: https://issues.apache.org/jira/browse/PIG-3527
> Project: Pig
> Issue Type: Sub-task
> Components: tez
> Reporter: Mark Wagner
> Assignee: Mark Wagner
> Fix For: tez-branch
>
> Attachments: PIG-3527.1.patch
>
>
> The PigProcessor needs to be able to handle multiple distinct inputs. These
> can come in a variety of flavors including multiple "file" inputs (Merge
> join), multiple shuffle inputs (Hash Join / Co-group), and a mix (Replicated
> Join).
--
This message was sent by Atlassian JIRA
(v6.1#6144)