PruneColumns optimization does not handle the case of foreach flatten correctly if flattened bag is not used later ------------------------------------------------------------------------------------------------------------------
Key: PIG-1184 URL: https://issues.apache.org/jira/browse/PIG-1184 Project: Pig Issue Type: Bug Affects Versions: 0.6.0 Reporter: Pradeep Kamath The following script : {noformat} -e "a = load 'input.txt' as (f1:chararray, f2:chararray, f3:bag{t:tuple(id:chararray)}, f4:bag{t:tuple(loc:chararray)}); b = foreach a generate f1, f2, flatten(f3), flatten(f4), 10; b = foreach b generate f1, f2, \$4; dump b;" {noformat} gives the following result: (oiue,M,10) {noformat} cat input.txt: oiue M {(3),(4)} {(toronto),(montreal)} {noformat} If PruneColumns optimizations is disabled, we get the right result: (oiue,M,10) (oiue,M,10) (oiue,M,10) (oiue,M,10) The flatten results in 4 records - so the output should contain 4 records. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.