[ 
https://issues.apache.org/jira/browse/PIG-3766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rohini Palaniswamy updated PIG-3766:
------------------------------------

    Attachment: PIG-3766-1.patch

Changes done:
  1) Removed the POLocalRearrange in SampleVertex and replaced it with a 
POValueOutTez for both orderby and skewedjoin. POValueOutTez takes multiple 
outputs. So got rid of the POSplit as well in skewed join sample vertex.
  2) Replaced the POPackage+POLocalRearrange in the partition vertex of left 
table (vertex 3) with a POIdentityInOutTez moving the project in 
POLocalRearrange into the POLocalRearrange in vertex 1. Also made the edge 1-1 
between vertex 1 and vertex 3. 

> Use ONE_TO_ONE edge and IdentityInOut in skewed join intermediate vertex
> ------------------------------------------------------------------------
>
>                 Key: PIG-3766
>                 URL: https://issues.apache.org/jira/browse/PIG-3766
>             Project: Pig
>          Issue Type: Sub-task
>          Components: tez
>            Reporter: Rohini Palaniswamy
>            Assignee: Rohini Palaniswamy
>             Fix For: tez-branch
>
>         Attachments: PIG-3766-1.patch
>
>
> From the first vertex (left table load) to the intermediate vertex that does 
> the partitioning of the keys based on the SkewedPartitionerTez, use 
> ONE_TO_ONE Tez edge and unsorted output and input instead of using a shuffle 
> edge. Also replace the POPackage->POForEach->POLocalRearrange in intermediate 
> vertex with POIdentityInOutTez.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to