[
https://issues.apache.org/jira/browse/KYLIN-4366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17028744#comment-17028744
]
ASF GitHub Bot commented on KYLIN-4366:
---------------------------------------
wangxiaojing123 commented on pull request #1093: KYLIN-4366 Build Global Dict
by MR/Hive, Merge to dict table Step implementation
URL: https://github.com/apache/kylin/pull/1093
issue link: https://issues.apache.org/jira/browse/KYLIN-4366
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
> Build Global Dict by MR/Hive, Merge to dict table Step implementation
> ---------------------------------------------------------------------
>
> Key: KYLIN-4366
> URL: https://issues.apache.org/jira/browse/KYLIN-4366
> Project: Kylin
> Issue Type: Sub-task
> Reporter: wangxiaojing
> Assignee: wangxiaojing
> Priority: Major
>
> Merge the global dict values of each columnĀ calculated by previous parallel
> total build step into the global dictionary table (default
> globalDictDatabase.cubeName_global_dict) of this cube.
> HQL example below:
> {code:java}
> //Add partition if not exists
> ALTER TABLE
> kylin_intermediate_kylin_sales_cube_mr_6222c210_ce2d_e8ce_dd0f_f12c38fa9115_global_dict
> add IF NOT EXISTS partition (dict_column='KYLIN_SALES_SELLER_ID');
> ALTER TABLE
> kylin_intermediate_kylin_sales_cube_mr_6222c210_ce2d_e8ce_dd0f_f12c38fa9115_global_dict
> add IF NOT EXISTS partition (dict_column='KYLIN_SALES_BUYER_ID');
> //merge data
> INSERT OVERWRITE TABLE db.kylin_sales_cube_mr_global_dict PARTITION
> (dict_column = 'KYLIN_SALES_SELLER_ID')
> SELECT dict_key, dict_val FROM db.kylin_sales_cube_mr_global_dict WHERE
> dict_column = 'KYLIN_SALES_SELLER_ID'
> UNION
> SELECT dict_key, dict_val FROM
> kylin_intermediate_kylin_sales_cube_mr_6222c210_ce2d_e8ce_dd0f_f12c38fa9115_global_dict
> WHERE dict_column = 'KYLIN_SALES_SELLER_ID' ;
> INSERT OVERWRITE TABLE db.kylin_sales_cube_mr_global_dict PARTITION
> (dict_column = 'KYLIN_SALES_BUYER_ID')
> SELECT dict_key, dict_val FROM db.kylin_sales_cube_mr_global_dict WHERE
> dict_column = 'KYLIN_SALES_BUYER_ID'
> UNION
> SELECT dict_key, dict_val FROM
> kylin_intermediate_kylin_sales_cube_mr_6222c210_ce2d_e8ce_dd0f_f12c38fa9115_global_dict
> WHERE dict_column = 'KYLIN_SALES_BUYER_ID' ;
> {code}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)