[jira] Commented: (PIG-446) Problem when schema contains more columns that actual data

Olga Natkovich (JIRA) Tue, 23 Sep 2008 09:12:05 -0700

    [ 
https://issues.apache.org/jira/browse/PIG-446?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12633781#action_12633781
 ]


Olga Natkovich commented on PIG-446:
------------------------------------

+1. My only concer is that if there is enough data like this we will be logging 
a lot. Buit we already doing this with UDFs so at least this is consistent. 
Once we address logging we could aggregate warnings.

> Problem when schema contains more columns that actual data
> ----------------------------------------------------------
>
>                 Key: PIG-446
>                 URL: https://issues.apache.org/jira/browse/PIG-446
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: types_branch
>            Reporter: Olga Natkovich
>            Assignee: Alan Gates
>             Fix For: types_branch
>
>         Attachments: PIG-446.patch
>
>
> Script:
> -- extra1 and extra2 are not present in the data
> A = load 'data' as (name, age: int, gpa, extra1, extra2);
> B = limit A 10;
> dump B;
> Error:
>  Out of bounds access: Request for field number 3 exceeds tuple size of 3
>         at 
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigMapReduce$Reduce.reduce(PigMapReduce.java:223)
>         at 
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigMapReduce$Reduce.reduce(PigMapReduce.java:121)
>         at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:391)
>         at 
> org.apache.hadoop.mapred.TaskTracker$Child.main(TaskTracker.java:2124)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (PIG-446) Problem when schema contains more columns that actual data

Reply via email to