[ 
https://issues.apache.org/jira/browse/TEZ-1190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14635661#comment-14635661
 ] 

Rohini Palaniswamy edited comment on TEZ-1190 at 7/21/15 7:21 PM:
------------------------------------------------------------------

[~bikassaha],
    The classes are UnionOptimizer and TezDAGBuilder before PIG-4495 went in. 
The code is not straightforward tez code though. 

For eg:
A = LOAD 'data';
SPLIT A into B if $0 > 5, C if $0 > 10 and $0 < 12; // This is just for an 
example. This simple condition can be written with just FILTER instead of split 
and union.
D = UNION B, C;
E = GROUP D by $1;

The pig plan would be like: V1(Load) - > VG1, V1(Load) - >VG1, VG1 - >V2 (Group 
by)

  Vertex group VG1 takes two inputs from same source vertex V1. The output 
vertex is only one i.e V2.


was (Author: rohini):
[~bikassaha],
    The classes are UnionOptimizer and TezDAGBuilder before PIG-4495 went in. 
The code is not straightforward tez code though. 

For eg:
A = LOAD 'data';
SPLIT A into B if $0 > 5, C if $0 > 10 and $0 < 12; // This is just for an 
example. This simple condition can be written with just FILTER instead of split 
and union.
D = UNION B, C;
E = GROUP D by $1;

The pig plan would be like - V1(Load) -> VG1, V1(Load)->VG1, VG1->V2 (Group by)

  Vertex group VG1 takes two inputs from same source vertex V1. The output 
vertex is only one i.e V2.

> Allow multiple edges between two vertexes
> -----------------------------------------
>
>                 Key: TEZ-1190
>                 URL: https://issues.apache.org/jira/browse/TEZ-1190
>             Project: Apache Tez
>          Issue Type: Bug
>            Reporter: Daniel Dai
>
> This will be helpful in some scenario. In particular example, we can merge 
> two small pipelines together in one pair of vertex. Note it is possible the 
> edge type between the two vertexes are different.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to