[
https://issues.apache.org/jira/browse/TAJO-727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13955082#comment-13955082
]
hyoungjunkim commented on TAJO-727:
-----------------------------------
LogicalOptimizer generates incorrect join key.
{noformat}
[Incoming]
[q_1396261715885_0001] 6 => 10 (type=HASH_SHUFFLE,
key=tpcds100.tmp1.avg_cs_ext_discount_amt (FLOAT8), num=32)
[q_1396261715885_0001] 9 => 10 (type=HASH_SHUFFLE,
key=tpcds100.cs1.cs_ext_discount_amt (FLOAT4), num=32)
[Outgoing]
[q_1396261715885_0001] 10 => 11 (type=HASH_SHUFFLE, key=, num=1)
GROUP_BY(33)()
=> exprs: (sum(tpcds100.cs1.cs_ext_discount_amt (FLOAT4)))
=> target list: ?sum_13 (FLOAT8)
=> out schema:{(1) ?sum_13 (FLOAT8)}
=> in schema:{(1) tpcds100.cs1.cs_ext_discount_amt (FLOAT4)}
JOIN(25)(INNER)
=> Join Cond: tpcds100.cs1.cs_ext_discount_amt (FLOAT4) >
tpcds100.tmp1.avg_cs_ext_discount_amt (FLOAT8)
=> target list: tpcds100.cs1.cs_ext_discount_amt (FLOAT4)
=> out schema: {(1) tpcds100.cs1.cs_ext_discount_amt (FLOAT4)}
=> in schema: {(2) tpcds100.tmp1.avg_cs_ext_discount_amt
(FLOAT8),tpcds100.cs1.cs_ext_discount_amt (FLOAT4)}
SCAN(32) on eb_1396261715885_0001_000009
=> out schema: {(1) tpcds100.cs1.cs_ext_discount_amt (FLOAT4)}
=> in schema: {(1) tpcds100.cs1.cs_ext_discount_amt (FLOAT4)}
SCAN(31) on eb_1396261715885_0001_000006
=> out schema: {(1) tpcds100.tmp1.avg_cs_ext_discount_amt (FLOAT8)}
=> in schema: {(1) tpcds100.tmp1.avg_cs_ext_discount_amt (FLOAT8)}
{noformat}
> TPC-DS Q32 returns no result.
> -----------------------------
>
> Key: TAJO-727
> URL: https://issues.apache.org/jira/browse/TAJO-727
> Project: Tajo
> Issue Type: Bug
> Reporter: hyoungjunkim
>
> I ran TPC-DS Q32 query.
> {code}
> SELECT sum(cs1.cs_ext_discount_amt) as excess_discount_amount
> FROM (SELECT cs.cs_item_sk as cs_item_sk,
> cs.cs_ext_discount_amt as cs_ext_discount_amt
> FROM catalog_sales cs
> JOIN date_dim d ON (d.d_date_sk = cs.cs_sold_date_sk)
> WHERE d.d_date between '2000-01-27' and '2000-04-27') cs1
> JOIN item i ON (i.i_item_sk = cs1.cs_item_sk)
> JOIN (SELECT cs2.cs_item_sk as cs_item_sk,
> 1.3 * avg(cs_ext_discount_amt) as
> avg_cs_ext_discount_amt
> FROM (SELECT cs.cs_item_sk as cs_item_sk,
> cs.cs_ext_discount_amt as
> cs_ext_discount_amt
> FROM catalog_sales cs
> JOIN date_dim d ON (d.d_date_sk = cs.cs_sold_date_sk)
> WHERE d.d_date between '2000-01-27' and '2000-04-27')
> cs2
> GROUP BY cs2.cs_item_sk) tmp1
> ON (i.i_item_sk = tmp1.cs_item_sk)
> WHERE i.i_manufact_id = 436 and
> cs1.cs_ext_discount_amt > tmp1.avg_cs_ext_discount_amt;
> {code}
> Tajo returns empty result.
> {noformat}
> result:
> hdfs://gruter102:6020/tmp/tajo-tajo/staging/q_1396256746100_0001/RESULT, 0
> rows (0 B)
> excess_discount_amount
> -------------------------------
> tpcds100>
> {noformat}
> This query is expected next result.
> {noformat}
> excess_discount_amount
> -------------------------------
> 8119270.359008789
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.2#6252)