[
https://issues.apache.org/jira/browse/PIG-5224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16024320#comment-16024320
]
Daniel Dai commented on PIG-5224:
---------------------------------
The inserted LOForEach remove all the columns which are not used in the scripts
going forward. The next LOForEach is not necessary doing that. I believe this
is not for performance reason (The performance gain for removing several
columns might be debatable), this is to make ColumnPruner simpler.
> Extra foreach from ColumnPrune preventing Accumulator usage
> -----------------------------------------------------------
>
> Key: PIG-5224
> URL: https://issues.apache.org/jira/browse/PIG-5224
> Project: Pig
> Issue Type: Improvement
> Reporter: Koji Noguchi
> Assignee: Koji Noguchi
> Attachments: pig-5224-v0-testonly.patch, pig-5224-v1.patch
>
>
> {code}
> A = load 'input' as (id:int, fruit);
> B = foreach A generate id; -- to enable columnprune
> C = group B by id;
> D = foreach C {
> o = order B by id;
> generate org.apache.pig.test.utils.AccumulatorBagCount(o);
> }
> STORE D into ...
> {code}
> Pig fails to use Accumulator interface for this UDF.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)