Koji Noguchi created PIG-5370: --------------------------------- Summary: Union onschema + columnprune dropping used fields Key: PIG-5370 URL: https://issues.apache.org/jira/browse/PIG-5370 Project: Pig Issue Type: Task Reporter: Koji Noguchi Assignee: Koji Noguchi
After PIG-5312, below query started failing. {code} A = load 'input.txt' as (a1:int, a2:chararray, a3:int); B = FOREACH (GROUP A by (a1,a2)) { A_FOREACH = FOREACH A GENERATE a2,a3; GENERATE A, FLATTEN(A_FOREACH) as (a2,a3); } C = load 'input2.txt' as (A:bag{tuple:(a1: int,a2: chararray,a3:int)},a2: chararray,a3:int); D = UNION ONSCHEMA B, C; dump D; {code} {code:title=input1.txt} 1 a 3 2 b 4 2 c 5 1 a 6 2 b 7 1 c 8 {code} {code:title=input2.txt} {(10,a0,30),(20,b0,40)} zzz 222 {code} {noformat:title=Expected output} ({(10,a0,30),(20,b0,40)},zzz,222) ({(1,a,6),(1,a,3)},a,6) ({(1,a,6),(1,a,3)},a,3) ({(1,c,8)},c,8) ({(2,b,7),(2,b,4)},b,7) ({(2,b,7),(2,b,4)},b,4) ({(2,c,5)},c,5) {noformat} {noformat:title=Actual (incorrect) output} ({(10,a0,30),(20,b0,40)}) ****ONLY 1 Field **** ({(1,a,6),(1,a,3)},a,6) ({(1,a,6),(1,a,3)},a,3) ({(1,c,8)},c,8) ({(2,b,7),(2,b,4)},b,7) ({(2,b,7),(2,b,4)},b,4) ({(2,c,5)},c,5) {noformat} -- This message was sent by Atlassian JIRA (v7.6.3#76005)