[
https://issues.apache.org/jira/browse/PIG-4808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Daniel Dai updated PIG-4808:
----------------------------
Resolution: Fixed
Hadoop Flags: Reviewed
Fix Version/s: 0.15.1
0.16.0
Status: Resolved (was: Patch Available)
+1.
Committed to both trunk and 0.15 branch since this leads to wrong result.
Thanks Eyal!
> PluckTuple overwrites regex if used more than once in the same script
> ---------------------------------------------------------------------
>
> Key: PIG-4808
> URL: https://issues.apache.org/jira/browse/PIG-4808
> Project: Pig
> Issue Type: Bug
> Affects Versions: 0.15.0
> Reporter: Eyal Allweil
> Assignee: Eyal Allweil
> Fix For: 0.16.0, 0.15.1
>
> Attachments: PIG-4808.patch
>
>
> If you define two PluckTuples with different regexes, and use them together,
> they will overwrite the regex field of one another's (this will not affect
> cases where non-regex prefixes are used).
> Example (from testOutput in TestPluckTuple.java, modified)
> a = load 'a' using mock.Storage();
> b = load 'b' using mock.Storage();
> c = join a by x, b by x;
> define pluck1 PluckTuple('a::.*');
> define pluck2 PluckTuple('b::.*');
> d = foreach c generate pluck1(*) AS pluck1, pluck2(*) AS pluck2;
> Expected:
> ((1,hey,2),(1,sasf,5))
> ((2,woah,3),(2,woah,6))
> Actual:
> ((1,sasf,5),(1,sasf,5))
> ((2,woah,6),(2,woah,6))
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)