Koji Noguchi created PIG-5370:
---------------------------------

             Summary: Union onschema + columnprune dropping used fields 
                 Key: PIG-5370
                 URL: https://issues.apache.org/jira/browse/PIG-5370
             Project: Pig
          Issue Type: Task
            Reporter: Koji Noguchi
            Assignee: Koji Noguchi


After PIG-5312, below query started failing.
{code}
A = load 'input.txt' as (a1:int, a2:chararray, a3:int);
B = FOREACH (GROUP A by (a1,a2)) {
    A_FOREACH = FOREACH A GENERATE a2,a3;
    GENERATE A, FLATTEN(A_FOREACH) as (a2,a3);
}
C = load 'input2.txt' as (A:bag{tuple:(a1: int,a2: chararray,a3:int)},a2: 
chararray,a3:int);

D = UNION ONSCHEMA B, C;

dump D;
{code}

{code:title=input1.txt}
1       a       3
2       b       4
2       c       5
1       a       6
2       b       7
1       c       8
{code}

{code:title=input2.txt}
{(10,a0,30),(20,b0,40)} zzz     222
{code}
{noformat:title=Expected output}
({(10,a0,30),(20,b0,40)},zzz,222)
({(1,a,6),(1,a,3)},a,6)
({(1,a,6),(1,a,3)},a,3)
({(1,c,8)},c,8)
({(2,b,7),(2,b,4)},b,7)
({(2,b,7),(2,b,4)},b,4)
({(2,c,5)},c,5)
{noformat}
{noformat:title=Actual (incorrect) output}
({(10,a0,30),(20,b0,40)})    ****ONLY 1 Field ****
({(1,a,6),(1,a,3)},a,6)
({(1,a,6),(1,a,3)},a,3)
({(1,c,8)},c,8)
({(2,b,7),(2,b,4)},b,7)
({(2,b,7),(2,b,4)},b,4)
({(2,c,5)},c,5)
{noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to