[ 
https://issues.apache.org/jira/browse/PIG-4515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14615168#comment-14615168
 ] 

Rohini Palaniswamy commented on PIG-4515:
-----------------------------------------

bq. What is the status of this bug? It is still present in 0.15.0!
   Missed reviewing it as it was not marked "Patch Available" and also did not 
have a Fix Version. You need to click on "Submit Patch" after posting a patch.  
Could you do that and also set version to 0.16?  Also as I mentioned earlier, 
this bug is not a blocker and required for 0.15 as there are 3 different ways 
of writing the script which is normally used by folks to achieve the result. 
builtin.Distinct was mainly used internally for combiner optimization and I 
have not seen it being used by many generally. They use the DISTINCT operator.

> org.apache.pig.builtin.Distinct throws ClassCastException
> ---------------------------------------------------------
>
>                 Key: PIG-4515
>                 URL: https://issues.apache.org/jira/browse/PIG-4515
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: 0.14.0
>         Environment: 2015-04-23 08:37:49,117 [main] INFO  org.apache.pig.Main 
> - Apache Pig version 0.14.0 (r1640057) compiled Nov 16 2014, 18:02:05
>            Reporter: Mikko Kupsu
>         Attachments: fix_singletuplebag_classcast_exception.patch, 
> fix_singletuplebag_classcast_exception_2.patch
>
>
> Running below script causes *ClassCastException*.
> {code}
> A = LOAD 'A' AS (a:int, b:int);
> B = GROUP A BY a;
> C = FOREACH B GENERATE Distinct(A);
> DUMP C;
> {code}
> Content of A:
> {code}
> 1     1
> 2     1
> 3     1
> 4     1
> 5     2
> 6     2
> 7     2
> 8     2
> 9     2
> {code}
> {code}
> Caused by: java.lang.ClassCastException: org.apache.pig.data.SingleTupleBag 
> cannot be cast to org.apache.pig.data.Tuple
>       at org.apache.pig.builtin.Distinct$Initial.exec(Distinct.java:86)
>       at org.apache.pig.builtin.Distinct$Initial.exec(Distinct.java:78)
>       at 
> org.apache.pig.backend.hadoop.executionengine.physicalLayer.expressionOperators.POUserFunc.getNext(POUserFunc.java:323)
>       at 
> org.apache.pig.backend.hadoop.executionengine.physicalLayer.expressionOperators.POUserFunc.getNextTuple(POUserFunc.java:362)
>       at 
> org.apache.pig.backend.hadoop.executionengine.physicalLayer.PhysicalOperator.getNext(PhysicalOperator.java:361)
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to