[ 
https://issues.apache.org/jira/browse/PIG-5341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16500838#comment-16500838
 ] 

Koji Noguchi commented on PIG-5341:
-----------------------------------

Attached {{pig-5341-v01.patch}} that fixes the off-by-one bug in determining 
which fields to keep. 
Also, tagfile/tagpath was always being added to tuple even when columnpruning 
was setting {{mRequiredColumns[0]}} to false.

Separate from this, found {{testColumnPrune}} wasn't really comparing the 
output.  Fixed that as well.

> PigStorage with -tagFile/-tagPath produces incorrect results with column 
> pruning
> --------------------------------------------------------------------------------
>
>                 Key: PIG-5341
>                 URL: https://issues.apache.org/jira/browse/PIG-5341
>             Project: Pig
>          Issue Type: Bug
>            Reporter: Koji Noguchi
>            Assignee: Koji Noguchi
>            Priority: Critical
>         Attachments: pig-5341-v01.patch
>
>
> I don't know why we didn't see this till now.
> {code}
> A = load 'test.txt' using PigStorage('\t', '-tagFile') as 
> (filename:chararray, a0:int, a1:int, a2:int, a3:int);
> B = FOREACH A GENERATE a0,a2;
> dump B;
> {code}
> Input 
> {noformat}
> knoguchi@pig > cat  test.txt
> 0       1       2       3
> 0       1       2       3
> 0       1       2       3
> {noformat}
> Expected Results
> {noformat}
> (0,2)
> (0,2)
> (0,2)
> {noformat}
> Actual Results
> {noformat}
> (,1)
> (,1)
> (,1)
> {noformat}
> This is really bad...



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to