-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/16313/
-----------------------------------------------------------

Review request for pig, Alex Bain, Daniel Dai, Mark Wagner, and Rohini 
Palaniswamy.


Bugs: PIG-3604
    https://issues.apache.org/jira/browse/PIG-3604


Repository: pig-git


Description
-------

Implemented replicated join in Tez as follows:
- POFRJoinTez extends POFRJoin. The difference between two is that replication 
hash table is constructed out of broadcasting edges in Tez instead of files on 
distributed cache in MR.
- TezCompiler adds a vertex per replicated table and connect it to POFRJoin 
vertex via broadcasting edge.

Note that in POLocalRerrangeTez, I package tuples in the same way for broadcast 
and scatter/gather edges, so I removed outputType (DataMovementType). 


Diffs
-----

  
src/org/apache/pig/backend/hadoop/executionengine/physicalLayer/relationalOperators/POFRJoin.java
 d7c54d8 
  
src/org/apache/pig/backend/hadoop/executionengine/physicalLayer/relationalOperators/POMergeJoin.java
 e900751 
  src/org/apache/pig/backend/hadoop/executionengine/tez/POFRJoinTez.java 
e69de29 
  
src/org/apache/pig/backend/hadoop/executionengine/tez/POLocalRearrangeTez.java 
cda5d89 
  src/org/apache/pig/backend/hadoop/executionengine/tez/PigProcessor.java 
7a1736a 
  src/org/apache/pig/backend/hadoop/executionengine/tez/TezCompiler.java 
2584501 
  test/e2e/pig/tests/tez.conf b280698 
  test/org/apache/pig/test/data/GoldenFiles/TEZC10.gld e69de29 
  test/org/apache/pig/tez/TestTezCompiler.java 79dc94e 

Diff: https://reviews.apache.org/r/16313/diff/


Testing
-------

Added a unit test case to TestTezCompiler.
Added a e2e test case to Join.

ant test-tez passes.
e2e test passes.


Thanks,

Cheolsoo Park

Reply via email to