[ 
https://issues.apache.org/jira/browse/PIG-894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12763776#action_12763776
 ] 

Pradeep Kamath commented on PIG-894:
------------------------------------

The patch uses pig.inputs property from jobconf which does not directly have 
the input file name - it actually has a serialized arrayList<Pair<FileSpec, 
Boolean>> in string form containing the filespec and the issplittable flag for 
each input for the job - this serialized string will need to be deserialized 
using ObjectSerializer.deserialize and then from the filespec, the filename 
will need to be retrieved.

> order-by fails when input is empty
> ----------------------------------
>
>                 Key: PIG-894
>                 URL: https://issues.apache.org/jira/browse/PIG-894
>             Project: Pig
>          Issue Type: Bug
>            Reporter: Thejas M Nair
>            Assignee: Daniel Dai
>         Attachments: PIG-894-1.patch
>
>
> grunt> l = load 'students.txt' ;
> grunt> f = filter l by 1 == 2;
> grunt> o = order f by $0 ;
> grunt> dump o;
> This results in 3 MR jobs . The 2nd (sampling) MR creates empty sample file, 
> and 3rd MR (order-by) fails with following error in Map job -
> java.lang.RuntimeException: java.lang.RuntimeException: Empty samples file
>       at 
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.partitioners.WeightedRangePartitioner.configure(WeightedRangePartitioner.java:104)
>       at 
> org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:58)
>       at 
> org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:82)
>       at org.apache.hadoop.mapred.MapTask$MapOutputBuffer.(MapTask.java:348)
>       at org.apache.hadoop.mapred.MapTask.run(MapTask.java:193)
>       at 
> org.apache.hadoop.mapred.TaskTracker$Child.main(TaskTracker.java:2207)
> Caused by: java.lang.RuntimeException: Empty samples file
>       at 
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.partitioners.WeightedRangePartitioner.configure(WeightedRangePartitioner.java:89)
>       ... 5 more

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to