[ 
https://issues.apache.org/jira/browse/PIG-3100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13537290#comment-13537290
 ] 

Jonathan Coveney commented on PIG-3100:
---------------------------------------

Hah, I didn't see that ticket. Whoops!

I think it is risky to be too clever about trying to figure out  delimiters and 
whatnot, but I can see why we might want to be. Hmm. Perhaps there can be a 
flag or setting for PigStorage that is a "strict" mode. IE throw an error if a 
record does not conform to the schema. If it's not see, pad nulls (or cut off 
extra columns). I think that might be a good compromise?
                
> If a .pig_schema file is present, can get an index out of bounds error
> ----------------------------------------------------------------------
>
>                 Key: PIG-3100
>                 URL: https://issues.apache.org/jira/browse/PIG-3100
>             Project: Pig
>          Issue Type: Bug
>            Reporter: Jonathan Coveney
>            Assignee: Jonathan Coveney
>             Fix For: 0.12
>
>         Attachments: PIG-3100-0_nows.patch, PIG-3100-0.patch
>
>
> In the case that a .pig_schema file is present, if you have a record with 
> fewer than expected fields, pig errors out with an index out of bounds 
> exception that is annoying, unnecessary, and unhelpful.
> Instead of improving logging, I decided to just do what pig should do, which 
> is fill in the records.
> Patch will include a test and the fix.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to