Hi zhixin,
 Data may become not correct if use "distribute by rand()".
 https://issues.apache.org/jira/browse/KYLIN-3388




------------------ ???????? ------------------
??????: "liuzhixin"<liuz...@163.com>;
????????: 2018??11??2??(??????) ????12:53
??????: "dev"<dev@kylin.apache.org>;
????: "ShaoFeng Shi"<shaofeng...@apache.org>; 
????: Re: Redistribute intermediate table default not by rand()



Hi kylin team:

Step: Redistribute intermediate table
#
??????????????????????????????DISTRIBUTE BY????????????????DISTRIBUTE BY RAND()
????????????????????????????????????????????????????????????????????

Best Regards??

> ?? 2018??11??2????????12:03??liuzhixin <liuz...@163.com> ??????
> 
> Hi kylin team:
> 
> Version: Kylin2.5-hadoop3.1 for hdp3.0
> #
> Step: Redistribute intermediate table
> #
> DISTRIBUTE BY is that:
> INSERT OVERWRITE TABLE table_intermediate SELECT * FROM table_intermediate 
> DISTRIBUTE BY Field1, Field2, Field3;
> #
> Not DISTRIBUTE BY RAND()
> #
> Is this default DISTRIBUTE BY Field1, Field2, Field3? how to DISTRIBUTE BY 
> RAND()?
> 
> Best wishes.
>

Reply via email to