[ 
https://issues.apache.org/jira/browse/PIG-1627?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13017347#comment-13017347
 ] 

Mridul Muralidharan commented on PIG-1627:
------------------------------------------

bytearray vs unknown schema use is always confusing.
The description in https://issues.apache.org/jira/browse/PIG-1876, for example, 
indicates that unknown schema implies it should be bytearray (desc starts with 
: "Currently Pig map type is untyped, which means map value is always of 
bytearray(ie. unknown) type." ..), while this JIRA seems to indicate it is not 
the case !

I have seen varying interpretations of what bytearray is supposed to mean in 
the jira's, pig docs and pig source code over the last 3+ years, not to mention 
in the various ilist's and user source codebass - some clarity in this regard 
would be good and less confusing.

> Flattening of bags with unknown schemas produces wrong schema
> -------------------------------------------------------------
>
>                 Key: PIG-1627
>                 URL: https://issues.apache.org/jira/browse/PIG-1627
>             Project: Pig
>          Issue Type: Bug
>          Components: impl
>    Affects Versions: 0.7.0
>            Reporter: Alan Gates
>            Assignee: Daniel Dai
>             Fix For: 0.9.0
>
>
> The following should produce an unknown schema:
> {code}
> A = load '/Users/gates/test/data/studenttab10';
> B = group A by $0;
> C = foreach B generate flatten(A);
> describe C;
> {code}
> Instead it gives
> {code}
> C: {bytearray}
> {code}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to