[
https://issues.apache.org/jira/browse/PIG-3295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Koji Noguchi updated PIG-3295:
------------------------------
Attachment: pig-3295-v06.patch
bq. but now I do see that my last patch added many failures in
TestTypeCheckingValidatorNewLP.
Looking at the test, I see that it's checking how the Loader info propagate or
won't propagate. It's using
* PigStorage('a')
and
* PigStorage('b')
to represent two different Loader. However, with my patch, these are now
considered equal since they both use the Utf8StorageConverter.
In most of the testcases, I replaced PigStorage('b') to
org.apache.pig.test.PigStorageWithDifferentCaster('b') so that testcase would
have two distinct loadcasters.
For testcase testCogroupStarLineageFail and testCogroupStarLineageNoSchemaFail,
I kept the PigStorage since the exception is thrown even with a single loader.
For testCogroupStreamingLineageNoSchema, I changed the expectedResult since now
PigStorage and PigStreaming both uses the same loadcaster, Utf8StorageConverter.
> Casting from bytearray failing after Union (even when each field is from a
> single Loader)
> -----------------------------------------------------------------------------------------
>
> Key: PIG-3295
> URL: https://issues.apache.org/jira/browse/PIG-3295
> Project: Pig
> Issue Type: Bug
> Components: parser
> Reporter: Koji Noguchi
> Assignee: Koji Noguchi
> Priority: Minor
> Attachments: pig-3295-v01.patch, pig-3295-v02.patch,
> pig-3295-v03.patch, pig-3295-v04.patch, pig-3295-v05.patch, pig-3295-v06.patch
>
>
> One example
> {noformat}
> A = load 'data1.txt' as line:bytearray;
> B = load 'c1.txt' using TextLoader() as cookie1;
> C = load 'c2.txt' using TextLoader() as cookie2;
> B2 = join A by line, B by cookie1;
> C2 = join A by line, C by cookie2;
> D = union onschema B2,C2; -- D: {A::line: bytearray,B::cookie1:
> bytearray,C::cookie2: bytearray}
> E = foreach D generate (chararray) line, (chararray) cookie1, (chararray)
> cookie2;
> dump E;
> {noformat}
> This script fails at runtime with
> "Caused by: org.apache.pig.backend.executionengine.ExecException: ERROR 1075:
> Received a bytearray from the UDF. Cannot determine how to convert the
> bytearray to string."
> This is different from PIG-3293 such that each field in 'D' belongs to a
> single loader whereas on PIG-3293, it came from multiple loader.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira