[
https://issues.apache.org/jira/browse/PIG-3835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Cheolsoo Park updated PIG-3835:
-------------------------------
Description:
PIG-3743 implements union using VertexGroup. Currently, union is a blocking
operator meaning that a new vertex is added for its succeeding operators.
But if there is only one store in the succeeding vertex, MROutput could be
directly attached to VertexGroup instead of adding a new vertex for it. Then,
each union source vertex will write directly to the destination, and therefore,
it will be faster.
was:
PIG-3742 implements union using VertexGroup. Currently, union is a blocking
operator meaning that a new vertex is added for its succeeding operators.
But if there is only one store in the succeeding vertex, MROutput could be
directly attached to VertexGroup instead of adding a new vertex for it. Then,
each union source vertex will write directly to the destination, and therefore,
it will be faster.
> Optimize union followed by store
> --------------------------------
>
> Key: PIG-3835
> URL: https://issues.apache.org/jira/browse/PIG-3835
> Project: Pig
> Issue Type: Sub-task
> Components: tez
> Affects Versions: tez-branch
> Reporter: Cheolsoo Park
> Fix For: tez-branch
>
>
> PIG-3743 implements union using VertexGroup. Currently, union is a blocking
> operator meaning that a new vertex is added for its succeeding operators.
> But if there is only one store in the succeeding vertex, MROutput could be
> directly attached to VertexGroup instead of adding a new vertex for it. Then,
> each union source vertex will write directly to the destination, and
> therefore, it will be faster.
--
This message was sent by Atlassian JIRA
(v6.2#6252)