[ https://issues.apache.org/jira/browse/PIG-429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Alan Gates reassigned PIG-429: ------------------------------ Assignee: Pradeep Kamath > Self join wth implicit split has the join output in wrong order > --------------------------------------------------------------- > > Key: PIG-429 > URL: https://issues.apache.org/jira/browse/PIG-429 > Project: Pig > Issue Type: Bug > Affects Versions: 0.2.0 > Reporter: Pradeep Kamath > Assignee: Pradeep Kamath > Fix For: 0.2.0 > > Attachments: PIG-429.patch > > > Query: > {code} > A = load 'st10k' split by 'file'; > B = filter A by $1 > 25; > D = join A by $0, B by $0; > dump D; > {code} > In the output the columns from B are projected out first and from A next. On > closer examination of the code, the ImplicitSplitInserter class adds in the > split and two splitoutput operators into the plan and tries the connect the > successors of LOad to these. However it does this by iterating over its > successors and disconnecting from them and connecting up the > split-splitoutput to the successors. However the order in which it gets its > successors is NOT the same as the order in which cogroup (join) expects its > inputs. Hence the discrepancy. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.