Rohini Palaniswamy created PIG-3856:
---------------------------------------
Summary: UnionOptimizer in Tez should optimize the case of
replicated join
Key: PIG-3856
URL: https://issues.apache.org/jira/browse/PIG-3856
Project: Pig
Issue Type: Sub-task
Reporter: Rohini Palaniswamy
Replicate join input that was broadcast to union vertex now needs to be
broadcast to all the union predecessors. So we need to
- Create edges from the Replicate join input to all the union predecessors
- Change replicate join input to write to multiple outputs.
This can be further optimized by using a shared edge which is yet to be
implemented in Tez (TEZ-391)
--
This message was sent by Atlassian JIRA
(v6.2#6252)