[
https://issues.apache.org/jira/browse/PIG-4515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14615168#comment-14615168
]
Rohini Palaniswamy commented on PIG-4515:
-----------------------------------------
bq. What is the status of this bug? It is still present in 0.15.0!
Missed reviewing it as it was not marked "Patch Available" and also did not
have a Fix Version. You need to click on "Submit Patch" after posting a patch.
Could you do that and also set version to 0.16? Also as I mentioned earlier,
this bug is not a blocker and required for 0.15 as there are 3 different ways
of writing the script which is normally used by folks to achieve the result.
builtin.Distinct was mainly used internally for combiner optimization and I
have not seen it being used by many generally. They use the DISTINCT operator.
> org.apache.pig.builtin.Distinct throws ClassCastException
> ---------------------------------------------------------
>
> Key: PIG-4515
> URL: https://issues.apache.org/jira/browse/PIG-4515
> Project: Pig
> Issue Type: Bug
> Affects Versions: 0.14.0
> Environment: 2015-04-23 08:37:49,117 [main] INFO org.apache.pig.Main
> - Apache Pig version 0.14.0 (r1640057) compiled Nov 16 2014, 18:02:05
> Reporter: Mikko Kupsu
> Attachments: fix_singletuplebag_classcast_exception.patch,
> fix_singletuplebag_classcast_exception_2.patch
>
>
> Running below script causes *ClassCastException*.
> {code}
> A = LOAD 'A' AS (a:int, b:int);
> B = GROUP A BY a;
> C = FOREACH B GENERATE Distinct(A);
> DUMP C;
> {code}
> Content of A:
> {code}
> 1 1
> 2 1
> 3 1
> 4 1
> 5 2
> 6 2
> 7 2
> 8 2
> 9 2
> {code}
> {code}
> Caused by: java.lang.ClassCastException: org.apache.pig.data.SingleTupleBag
> cannot be cast to org.apache.pig.data.Tuple
> at org.apache.pig.builtin.Distinct$Initial.exec(Distinct.java:86)
> at org.apache.pig.builtin.Distinct$Initial.exec(Distinct.java:78)
> at
> org.apache.pig.backend.hadoop.executionengine.physicalLayer.expressionOperators.POUserFunc.getNext(POUserFunc.java:323)
> at
> org.apache.pig.backend.hadoop.executionengine.physicalLayer.expressionOperators.POUserFunc.getNextTuple(POUserFunc.java:362)
> at
> org.apache.pig.backend.hadoop.executionengine.physicalLayer.PhysicalOperator.getNext(PhysicalOperator.java:361)
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)