[jira] [Comment Edited] (KYLIN-4457) Query cube result doesn't math with spark sql

wangrupeng (Jira) Thu, 23 Apr 2020 00:16:13 -0700


    [ 
https://issues.apache.org/jira/browse/KYLIN-4457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17090333#comment-17090333
 ]


wangrupeng edited comment on KYLIN-4457 at 4/23/20, 7:15 AM:
-------------------------------------------------------------

It's not a bug. This problem happened because that when build segment with 
range(2011-01-01 00:00:00, 2013-01-01 00:00:00), kylin will not process the 
data in the date of '2013-01-01'. And spark will include the data of 
'2013-01-01'. We can verify correctness of cube by using the following sql.

select count * FROM TEST_KYLIN_FACT LEFT JOIN TEST_ORDER ON 
TEST_KYLIN_FACT.ORDER_ID = TEST_ORDER.ORDER_ID where CAL_DT > cast('2011-01-01' 
as date) and CAL_DT < cast('2013-01-01' as date)


was (Author: wangrupeng):
It's not a bug. This problem happened because that when build segment with 
range(2011-01-01 00:00:00, 2013-01-01 00:00:00), kylin will not process the 
data in the date of '2013-01-01'. And spark will include the data of 
'2013-01-01'. We can verify correctness of cube by using the following sql.

select count(*) FROM TEST_KYLIN_FACT LEFT JOIN TEST_ORDER ON 
TEST_KYLIN_FACT.ORDER_ID = TEST_ORDER.ORDER_ID where CAL_DT > cast('2011-01-01' 
as date) and CAL_DT < cast('2013-01-01' as date)

> Query cube result doesn't math with spark sql
> ---------------------------------------------
>
>                 Key: KYLIN-4457
>                 URL: https://issues.apache.org/jira/browse/KYLIN-4457
>             Project: Kylin
>          Issue Type: Bug
>          Components: Query Engine, Storage - Parquet
>            Reporter: xuekaiqi
>            Assignee: wangrupeng
>            Priority: Major
>             Fix For: v4.0.0-beta
>
>   Original Estimate: 16h
>  Remaining Estimate: 16h
>
> Source: local CSV
> Result:
> ||cube||spark||
> |5010|5030|
> SQL:
> {code:java}
> select count(*)  FROM TEST_KYLIN_FACT LEFT JOIN TEST_ORDER ON 
> TEST_KYLIN_FACT.ORDER_ID = TEST_ORDER.ORDER_ID where CAL_DT > '2011-01-01 
> 00:00:00' and CAL_DT < '2013-01-01 00:00:00'
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Comment Edited] (KYLIN-4457) Query cube result doesn't math with spark sql

Reply via email to