[ 
https://issues.apache.org/jira/browse/KYLIN-2800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Billy Liu reassigned KYLIN-2800:
--------------------------------

    Assignee: zhengdong

> All dictionaries should be built based on the flat hive table
> -------------------------------------------------------------
>
>                 Key: KYLIN-2800
>                 URL: https://issues.apache.org/jira/browse/KYLIN-2800
>             Project: Kylin
>          Issue Type: Bug
>            Reporter: zhengdong
>            Assignee: zhengdong
>         Attachments: 
> 0001-KYLIN-2800-All-dictionaries-should-be-built-based-on.patch
>
>
> After KYLIN-2457, we still got wrong query result sometimes after a merging 
> job finished. 
> Finally, we realize the root cause is that we always use lookup table as 
> source data to build dictionaries for FK columns. 
> However, incremental lookup table doesn't mean sequential and incremental PK. 
> If a new record inserted into the lookup table while its PK column does not 
> have the max value, ID numbers in the new dictionary could be changed for 
> those PK value larger than the newest one. What's more, using lookup table as 
> source data for FK column's dictionary may has performance advantage for 
> merging job, but also may encounter too big dictionary problem for large 
> lookup tables. And we must add some validation rules to ensure the PK value 
> sequential and incremental.
> On the another hand, we could just unify using the flat hive table as data 
> source for all dictionaries.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to