GROUP BY multiple column not working with new optimizer
-------------------------------------------------------

                 Key: PIG-1523
                 URL: https://issues.apache.org/jira/browse/PIG-1523
             Project: Pig
          Issue Type: Bug
          Components: impl
    Affects Versions: 0.7.0
            Reporter: Swati Jain


The following script does a GroupBy multiple columns:
{norformat}
A = load '<any file>' USING PigStorage(',') as (a1:int,a2:int,a3:int);
G1 = GROUP A by (a1,a2);
D = Filter G1 by group.$0 > 1;
explain D;
{noformat}

The above fails with the following error when the new optimizer is enabled (it 
fails with the old framework too but only when it gets to the execution stage):
{noformat}
Caused by: java.lang.NullPointerException
        at 
org.apache.pig.experimental.logical.LogicalPlanMigrationVistor$LogicalExpPlanMigrationVistor.visit(LogicalPlanMigrationVistor.java:424)
        at org.apache.pig.impl.logicalLayer.LOProject.visit(LOProject.java:404)
        at org.apache.pig.impl.logicalLayer.LOProject.visit(LOProject.java:58)
        at 
org.apache.pig.impl.plan.DependencyOrderWalker.walk(DependencyOrderWalker.java:69)
        at 
org.apache.pig.experimental.logical.LogicalPlanMigrationVistor.translateExpressionPlan(LogicalPlanMigrationVistor.java:155)
        at 
org.apache.pig.experimental.logical.LogicalPlanMigrationVistor.visit(LogicalPlanMigrationVistor.java:295)
        at org.apache.pig.impl.logicalLayer.LOFilter.visit(LOFilter.java:116)
        at org.apache.pig.impl.logicalLayer.LOFilter.visit(LOFilter.java:41)
        at 
org.apache.pig.impl.plan.DependencyOrderWalker.walk(DependencyOrderWalker.java:69)
        at org.apache.pig.impl.plan.PlanVisitor.visit(PlanVisitor.java:51)
        at 
org.apache.pig.backend.hadoop.executionengine.HExecutionEngine.compile(HExecutionEngine.java:237)
{noformat}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to