PruneColumns optimization does not handle the case of foreach flatten correctly 
if flattened bag is not used later
------------------------------------------------------------------------------------------------------------------

                 Key: PIG-1184
                 URL: https://issues.apache.org/jira/browse/PIG-1184
             Project: Pig
          Issue Type: Bug
    Affects Versions: 0.6.0
            Reporter: Pradeep Kamath


The following script :
{noformat}
-e "a = load 'input.txt' as (f1:chararray, f2:chararray, 
f3:bag{t:tuple(id:chararray)}, f4:bag{t:tuple(loc:chararray)}); b = foreach a 
generate f1, f2, flatten(f3), flatten(f4), 10; b = foreach b generate f1, f2, 
\$4; dump b;"
{noformat}
gives the following result:
(oiue,M,10)

{noformat}
cat input.txt:
oiue    M       {(3),(4)}       {(toronto),(montreal)}
{noformat}

If PruneColumns optimizations is disabled, we get the right result:
(oiue,M,10)
(oiue,M,10)
(oiue,M,10)
(oiue,M,10)

The flatten results in 4 records - so the output should contain 4 records.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to