[
https://issues.apache.org/jira/browse/TAJO-2082?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15173262#comment-15173262
]
ASF GitHub Bot commented on TAJO-2082:
--------------------------------------
GitHub user jihoonson opened a pull request:
https://github.com/apache/tajo/pull/969
TAJO-2082: Aggregation on a derived table which includes union can cause
incorrect result
I didn't add unit test because it is difficult to reproduce this bug with
unit test.
IMO, it is better to add a verifier to test global plan. I created a jira
ticket for global plan verifier.
(https://issues.apache.org/jira/browse/TAJO-2084)
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/jihoonson/tajo-2 TAJO-2082
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/tajo/pull/969.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #969
----
commit b47ba9a9016c1bb7f763157c61915491af58c12f
Author: Jihoon Son <[email protected]>
Date: 2016-02-25T08:06:18Z
Add shuffle info to master plan
commit db17a92b91fc648dbd5523cfc2b08df19958f448
Author: Jihoon Son <[email protected]>
Date: 2016-02-25T15:19:55Z
Add PlanContext.
commit 00943976a1ba6172082f9b1f10e2875aec440145
Author: Jihoon Son <[email protected]>
Date: 2016-02-26T09:26:42Z
TAJO-2082
commit c1af7a88663d551d99da4081e8b02e7b99914861
Author: Jihoon Son <[email protected]>
Date: 2016-02-29T07:37:50Z
Refactoring stage
commit 3ee7bbbc464eaacd92b4c86fde62c9adb98ff32a
Author: Jihoon Son <[email protected]>
Date: 2016-03-01T05:32:48Z
Test finished.
commit 5ac73773f867bf234ec86c49adc116b99792b695
Author: Jihoon Son <[email protected]>
Date: 2016-03-01T05:40:47Z
fix comment
----
> Aggregation on a derived table which includes union can cause incorrect result
> ------------------------------------------------------------------------------
>
> Key: TAJO-2082
> URL: https://issues.apache.org/jira/browse/TAJO-2082
> Project: Tajo
> Issue Type: Bug
> Components: distributed query plan
> Reporter: Jihoon Son
> Assignee: Jihoon Son
> Priority: Blocker
> Fix For: 0.11.2
>
>
> This problem can be reproduced by running the following query on 10GB TPC-H
> data set.
> {noformat}
> select
> sum(t.cnt) as cnt, o_orderkey, o_custkey
> from
> (
> select
> o_orderkey, o_custkey, CAST(COUNT(1) AS INT4) as cnt
> from
> orders
> group by
> o_orderkey, o_custkey
> union all
> select
> o_orderkey, o_custkey, o_shippriority
> from
> orders
> ) as t
> group by
> o_orderkey, o_custkey
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)