GROUP BY multiple column not working with new optimizer
-------------------------------------------------------
Key: PIG-1523
URL: https://issues.apache.org/jira/browse/PIG-1523
Project: Pig
Issue Type: Bug
Components: impl
Affects Versions: 0.7.0
Reporter: Swati Jain
The following script does a GroupBy multiple columns:
{norformat}
A = load '<any file>' USING PigStorage(',') as (a1:int,a2:int,a3:int);
G1 = GROUP A by (a1,a2);
D = Filter G1 by group.$0 > 1;
explain D;
{noformat}
The above fails with the following error when the new optimizer is enabled (it
fails with the old framework too but only when it gets to the execution stage):
{noformat}
Caused by: java.lang.NullPointerException
at
org.apache.pig.experimental.logical.LogicalPlanMigrationVistor$LogicalExpPlanMigrationVistor.visit(LogicalPlanMigrationVistor.java:424)
at org.apache.pig.impl.logicalLayer.LOProject.visit(LOProject.java:404)
at org.apache.pig.impl.logicalLayer.LOProject.visit(LOProject.java:58)
at
org.apache.pig.impl.plan.DependencyOrderWalker.walk(DependencyOrderWalker.java:69)
at
org.apache.pig.experimental.logical.LogicalPlanMigrationVistor.translateExpressionPlan(LogicalPlanMigrationVistor.java:155)
at
org.apache.pig.experimental.logical.LogicalPlanMigrationVistor.visit(LogicalPlanMigrationVistor.java:295)
at org.apache.pig.impl.logicalLayer.LOFilter.visit(LOFilter.java:116)
at org.apache.pig.impl.logicalLayer.LOFilter.visit(LOFilter.java:41)
at
org.apache.pig.impl.plan.DependencyOrderWalker.walk(DependencyOrderWalker.java:69)
at org.apache.pig.impl.plan.PlanVisitor.visit(PlanVisitor.java:51)
at
org.apache.pig.backend.hadoop.executionengine.HExecutionEngine.compile(HExecutionEngine.java:237)
{noformat}
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.