[
https://issues.apache.org/jira/browse/KYLIN-4345?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
wangxiaojing updated KYLIN-4345:
--------------------------------
Description:
This step mainly implements temporary dictionary coding for each global dict
column in parallel through MR and stored in HDFS.
* Example stored in HDFS:
/user/kylin/globaldic_test/kylin-188c9f9d_dabb_944e_9f20_99dc95be66e6/kylin_sales_cube_mr/dict_column=KYLIN_SALES_SELLER_ID/part_sort
* Content is : temporary dict value and Original value, example like below:
1 200002222
2 100001
3 300001
was:
This step mainly implements temporary dictionary coding for each global dict
column in parallel through MR and stored in HDFS.
* Example stored in HDFS:
/user/kylin/tmp/kylin/globaldic_test/kylin-188c9f9d_dabb_944e_9f20_99dc95be66e6/kylin_sales_cube_mr/dict_column=KYLIN_SALES_SELLER_ID/part_sort
* Content is : temporary dict value and Original value, example like below:
1 200002222
2 100001
3 300001
> Build Global Dict by MR/Hive, Parallel Part Build Step implementation
> ---------------------------------------------------------------------
>
> Key: KYLIN-4345
> URL: https://issues.apache.org/jira/browse/KYLIN-4345
> Project: Kylin
> Issue Type: Sub-task
> Reporter: wangxiaojing
> Assignee: wangxiaojing
> Priority: Major
>
> This step mainly implements temporary dictionary coding for each global dict
> column in parallel through MR and stored in HDFS.
> * Example stored in HDFS:
>
> /user/kylin/globaldic_test/kylin-188c9f9d_dabb_944e_9f20_99dc95be66e6/kylin_sales_cube_mr/dict_column=KYLIN_SALES_SELLER_ID/part_sort
> * Content is : temporary dict value and Original value, example like below:
> 1 200002222
> 2 100001
> 3 300001
--
This message was sent by Atlassian Jira
(v8.3.4#803005)