[
https://issues.apache.org/jira/browse/PIG-2636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13553545#comment-13553545
]
Eli Finkelshteyn commented on PIG-2636:
---------------------------------------
Using cogroup instead of join is a work around for this (i.e. just replace
"join" with "cogroup"). You'll need to handle the result set slightly
differently, but anything possible with a join is also possible with a cogroup.
I'm pretty sure the issue here is the same as the one in
https://issues.apache.org/jira/browse/PIG-3020 and is solved for 0.11.
> "duplicate uid in schema" error from script that uses multiple FOREACH on
> same relation
> ---------------------------------------------------------------------------------------
>
> Key: PIG-2636
> URL: https://issues.apache.org/jira/browse/PIG-2636
> Project: Pig
> Issue Type: Bug
> Affects Versions: 0.9.0
> Reporter: Pete Gieser
> Priority: Minor
>
> The following script:
> A = LOAD 'bug.in' AS a:tuple(x:int, y:int);
> B1 = FOREACH A GENERATE a.x, a.y;
> B2 = FOREACH A GENERATE a.x, a.y;
> C = JOIN B1 BY x, B2 by x;
> yields the following error:
> org.apache.pig.impl.plan.PlanValidationException: ERROR 2270: Logical plan
> invalid state: duplicate uid in schema :
> B1::x#35:int,B1::y#36:int,B2::x#35:int,B2::y#36:int
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira