Cheolsoo Park created PIG-3835:
----------------------------------
Summary: Optimize union followed by store
Key: PIG-3835
URL: https://issues.apache.org/jira/browse/PIG-3835
Project: Pig
Issue Type: Sub-task
Components: tez
Affects Versions: tez-branch
Reporter: Cheolsoo Park
Fix For: tez-branch
PIG-3742 implements union using VertexGroup. Currently, union is a blocking
operator meaning that a new vertex is added for its succeeding operators.
But if there is only one store in the succeeding vertex, MROutput could be
directly attached to VertexGroup instead of adding a new vertex for it. Then,
each union source vertex will write directly to the destination, and therefore,
it will be faster.
--
This message was sent by Atlassian JIRA
(v6.2#6252)