----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/16463/ -----------------------------------------------------------
Review request for pig, Alex Bain, Daniel Dai, Mark Wagner, and Rohini Palaniswamy. Bugs: PIG-3636 https://issues.apache.org/jira/browse/PIG-3636 Repository: pig-git Description ------- The patch implements accumulator optimization in Tez. The changes include- * Create AccumulatorOptimizer in Tez. * Create AccumulatorOptimizerUtil class and factor out common functions in MR and Tez. * Implement accumulator logic in POShuffleTezLoad. * Update TestAccumulator to make it run in Tez mode. Diffs ----- src/org/apache/pig/PigConfiguration.java 0a26e8c src/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/AccumulatorOptimizer.java 7f9e15a src/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/MapReduceLauncher.java 9eed25c src/org/apache/pig/backend/hadoop/executionengine/physicalLayer/relationalOperators/POPackage.java 6e04513 src/org/apache/pig/backend/hadoop/executionengine/tez/AccumulatorOptimizer.java e69de29 src/org/apache/pig/backend/hadoop/executionengine/tez/POShuffleTezLoad.java 722b9f6 src/org/apache/pig/backend/hadoop/executionengine/tez/POUnionTezLoad.java 742a33a src/org/apache/pig/backend/hadoop/executionengine/tez/TezDagBuilder.java d42ce89 src/org/apache/pig/backend/hadoop/executionengine/tez/TezLauncher.java c6af682 src/org/apache/pig/backend/hadoop/executionengine/tez/TezPlanContainer.java e33a7c6 src/org/apache/pig/backend/hadoop/executionengine/util/AccumulatorOptimizerUtil.java e69de29 test/org/apache/pig/test/TestAccumulator.java b979649 test/org/apache/pig/test/TestCombiner.java a227d18 test/tez-tests fcb573e Diff: https://reviews.apache.org/r/16463/diff/ Testing ------- * TestAccumulator passes in Tez mode. * All unit tests pass. * All e2e tests pass. Note that 3 test cases in TestAccumulator are annotated as @Ignore because SecondaryKeyOptimizer in Tez is not implement yet. The test cases expect accumulator optimizer is applied when order-by and distinct are present in a nested foreash because these operators are supposed to be removed by SecondaryKeyOptimizer. Added TODO comments accordingly. Thanks, Cheolsoo Park