[ https://issues.apache.org/jira/browse/PIG-592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Christopher Olston updated PIG-592: ----------------------------------- Description: A simple pig script, that never introduces any schema information: A = load 'foo'; B = foreach (group A by $8) generate group, COUNT($1); C = load 'bar'; // ('bar' has two columns) D = join B by $0, C by $0; E = foreach D generate $0, $1, $3; Fails, complaining that $3 does not exist: java.io.IOException: Out of bound access. Trying to access non-existent column: 3. Schema {B::group: bytearray,long,bytearray} has 3 column(s). Apparently Pig gets confused, and thinks it knows the schema for C (a single bytearray column). was: A simple pig script, that never introduces any schema information: A = load 'foo'; B = foreach (group A by $8) generate group, COUNT($1); C = load 'bar'; // ('bar' has two columns) D = join B by $0, C by $0; E = foreach D generate $0, $1, $3; Fails, complaining that $3 does not exist: java.io.IOException: Out of bound access. Trying to access non-existent column: 3. Schema {ID2::group: bytearray,long,bytearray} has 3 column(s). Apparently Pig gets confused, and thinks it knows the schema for C (a single bytearray column). > schema inferred incorrectly > --------------------------- > > Key: PIG-592 > URL: https://issues.apache.org/jira/browse/PIG-592 > Project: Pig > Issue Type: Bug > Affects Versions: types_branch > Reporter: Christopher Olston > > A simple pig script, that never introduces any schema information: > A = load 'foo'; > B = foreach (group A by $8) generate group, COUNT($1); > C = load 'bar'; // ('bar' has two columns) > D = join B by $0, C by $0; > E = foreach D generate $0, $1, $3; > Fails, complaining that $3 does not exist: > java.io.IOException: Out of bound access. Trying to access non-existent > column: 3. Schema {B::group: bytearray,long,bytearray} has 3 column(s). > Apparently Pig gets confused, and thinks it knows the schema for C (a single > bytearray column). -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.