[
https://issues.apache.org/jira/browse/PIG-4018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14038138#comment-14038138
]
Cheolsoo Park commented on PIG-4018:
------------------------------------
[~tmwoodruff], I ran the unit tests and found the following failures-
{code}
>>> org.apache.pig.test.TestUnionOnSchema.testUnionOnSchemaUdfTypeEvolution2
>>> org.apache.pig.test.TestUnionOnSchema.testUnionOnSchemaUdfTypeEvolution
>>> org.apache.pig.test.TestUnionOnSchema.testUnionOnSchemaIncompatibleTypes
{code}
Can you take a look at them?
> Schema validation fails with UNION ONSCHEMA
> -------------------------------------------
>
> Key: PIG-4018
> URL: https://issues.apache.org/jira/browse/PIG-4018
> Project: Pig
> Issue Type: Bug
> Affects Versions: 0.13.0
> Reporter: Travis Woodruff
> Assignee: Travis Woodruff
> Attachments: PIG-4018.patch
>
>
> When relations with differing schemas are unioned (using UNION ONSCHEMA),
> schema validation can fail with this exception:
> {{org.apache.pig.impl.plan.PlanValidationException: Logical plan invalid
> state: invalid uid -1 in schema}}
> This worked before the fix for PIG-3492.
> The merged schema (from {{LOUnion.getSchema()}}) does not contain uids for
> columns not in the schema of the first input (uids are set to -1). This is
> because only the first input's schema is used for looking up "cached" uids.
> Normally, this isn't a problem because {{UnionOnSchemaSetter}} comes along
> and fixes the missing fields.
> However, when {{ImplicitSplitInsertVisitor}} is active, it is called before
> {{UnionOnSchemaSetter}}. {{ImplicitSplitInsertVisitor}} calls
> {{schemaResetter.visit()}}, which throws the validation exception because
> {{UnionOnSchemaSetter}} has not had a chance to create the missing fields
> (and thus uids are still -1 for these fields).
--
This message was sent by Atlassian JIRA
(v6.2#6252)