Re: Re: [DISCUSS] New Kylin Streaming Solution From eBay

2018-11-01 Thread ShaoFeng Shi
Hi Gang, Thank you for the information, that is helpful for understanding the overall design and implementation. Do you have some statistical information, like performance, throughput, stability, etc.? Besides, what's the plan of contributing it to the community? Thanks! Ma Gang 于2018年11月1日周四

Re: [DISCUSS] New Kylin Streaming Solution From eBay

2018-11-01 Thread Xiaoxiang Yu
Thank you for your reply. Maybe I can help to improve your Kylin Streaming Solution in the future. Best wishes, Xiaoxiang Yu On [DATE], "[NAME]" <[ADDRESS]> wrote: Thanks Xiaoxiang, Very good questions! Please see my comments started with [Gang]: 1.

Re:Re: Re: [DISCUSS] New Kylin Streaming Solution From eBay

2018-11-01 Thread Ma Gang
Hi ShaoFeng, For streaming ingest/query performance, there is a doc: https://drive.google.com/file/d/1GSBMpRuVQRmr8Ev2BWvssfMd-Rck9vsH/view?ths=true , it is also in the design doc's 'performance' section attached in the jira: https://issues.apache.org/jira/browse/KYLIN-3654 For stability, it is

[jira] [Created] (KYLIN-3662) exception message "Cannot find project '%s'." should be formated

2018-11-01 Thread Lingang Deng (JIRA)
Lingang Deng created KYLIN-3662: --- Summary: exception message "Cannot find project '%s'." should be formated Key: KYLIN-3662 URL: https://issues.apache.org/jira/browse/KYLIN-3662 Project: Kylin

Redistribute intermediate table default not by rand()

2018-11-01 Thread liuzhixin
Hi kylin team: Version: Kylin2.5-hadoop3.1 for hdp3.0 # Step: Redistribute intermediate table # DISTRIBUTE BY is that: INSERT OVERWRITE TABLE table_intermediate SELECT * FROM table_intermediate DISTRIBUTE BY Field1, Field2, Field3; # Not DISTRIBUTE BY RAND() # Is this default DISTRIBUTE BY Field1

Re: Redistribute intermediate table default not by rand()

2018-11-01 Thread liuzhixin
Hi kylin team: Step: Redistribute intermediate table # 默认选择了维度的前三个字段作为DISTRIBUTE BY的依据,没有采用DISTRIBUTE BY RAND() 如果没有合适的维度字段,这样的默认策略将会导致数据更加的数据不均衡。 Best Regards! > 在 2018年11月2日,下午12:03,liuzhixin 写道: > > Hi kylin team: > > Version: Kylin2.5-hadoop3.1 for hdp3.0 > # > Step: Redistribute intermed

?????? Redistribute intermediate table default not by rand()

2018-11-01 Thread Chao Long
Hi zhixin, Data may become not correct if use "distribute by rand()". https://issues.apache.org/jira/browse/KYLIN-3388 -- -- ??: "liuzhixin"; : 2018??11??2??(??) 12:53 ??: "dev"; : "ShaoFeng Shi"; : Re: Redistribute int

Re: Redistribute intermediate table default not by rand()

2018-11-01 Thread ShaoFeng Shi
Please move the high cardinality dimensions to the leading position of rowkey, that will make the data distribution more even; Chao Long 于2018年11月2日周五 下午1:38写道: > Hi zhixin, > Data may become not correct if use "distribute by rand()". > https://issues.apache.org/jira/browse/KYLIN-3388 > > > >

Re: Redistribute intermediate table default not by rand()

2018-11-01 Thread liuzhixin
Hi ShaoFeng Shi OK, thank you for the answer. # Perhaps Kylin should provide the tips or notes for the default shard. Best Wishes. > 在 2018年11月2日,下午1:42,ShaoFeng Shi 写道: > > Please move the high cardinality dimensions to the leading position of > rowkey, that will make the data distribution mo

Re: Redistribute intermediate table default not by rand()

2018-11-01 Thread liuzhixin
Hi Chao Long: Thank you for the answer. # Maybe kylin should provide config for every build step Best wishes. > 在 2018年11月2日,下午1:38,Chao Long 写道: > > Hi zhixin, > Data may become not correct if use "distribute by rand()". > https://issues.apache.org/jira/browse/KYLIN-3388 > > > > > ---

Re: Redistribute intermediate table default not by rand()

2018-11-01 Thread ShaoFeng Shi
Hi Zhixin, Kylin 2.5.1 will add some tips in the advanced step, hope that can help. liuzhixin 于2018年11月2日周五 下午2:05写道: > Hi Chao Long: > > Thank you for the answer. > # > Maybe kylin should provide config for every build step > > Best wishes. > > > 在 2018年11月2日,下午1:38,Chao Long 写道: > > > > Hi z

[VOTE] Release apache-kylin-2.5.1 (RC1)

2018-11-01 Thread ShaoFeng Shi
Hi all, I have created a build for Apache Kylin 2.5.1, release candidate 1. Changes highlights: [KYLIN-3531] - Login failed with case-insensitive username [KYLIN-3604] - Can't build cube with spark in HBase standalone mode [KYLIN-3613] - Kylin with Standalone HBase Cluster could not find the mai

??????[VOTE] Release apache-kylin-2.5.1 (RC1)

2018-11-01 Thread Chao Long
+1 -- -- ??: "ShaoFeng Shi"; : 2018??11??2??(??) 2:09 ??: "dev"; : [VOTE] Release apache-kylin-2.5.1 (RC1) Hi all, I have created a build for Apache Kylin 2.5.1, release candidate 1. Changes highlights: [KYLIN-3531] - L

Re: 回复:[VOTE] Release apache-kylin-2.5.1 (RC1)

2018-11-01 Thread zhan shaoxiong
+1 On [DATE], "[NAME]" <[ADDRESS]> wrote: +1 -- 原始邮件 -- 发件人: "ShaoFeng Shi"; 发送时间: 2018年11月2日(星期五) 下午2:09 收件人: "dev"; 主题: [VOTE] Release apache-kylin-2.5.1 (RC1) Hi all, I have created a

Re: [VOTE] Release apache-kylin-2.5.1 (RC1)

2018-11-01 Thread zhan shaoxiong
+1 On [DATE], "[NAME]" <[ADDRESS]> wrote: Hi all, I have created a build for Apache Kylin 2.5.1, release candidate 1. Changes highlights: [KYLIN-3531] - Login failed with case-insensitive username [KYLIN-3604] - Can't build cube with spark in HBase standalone m

Re: Redistribute intermediate table default not by rand()

2018-11-01 Thread liuzhixin
Hi ShaoFeng Shi, Thank you for the answer. # Step1: Create Intermediate Flat Hive Table Step2: Redistribute intermediate table # Perhaps, Kylin can insert one rand column for the next shard, (as default). At the same time, Kylin should support the custom column for shard. Best Wishes. > 在 2018年

Re: Redistribute intermediate table default not by rand()

2018-11-01 Thread liuzhixin
Hi ShaoFeng Shi, Thank you for the answer. # Step1: Create Intermediate Flat Hive Table Step2: Redistribute intermediate table # Perhaps, Kylin can insert one rand column for the next shard, (as default). At the same time, Kylin should support the custom column for shard. Best Wishes. > 在 2018年

[jira] [Created] (KYLIN-3663) Failed to delete project when project has more than one table

2018-11-01 Thread rongchuan.jin (JIRA)
rongchuan.jin created KYLIN-3663: Summary: Failed to delete project when project has more than one table Key: KYLIN-3663 URL: https://issues.apache.org/jira/browse/KYLIN-3663 Project: Kylin

Re: [VOTE] Release apache-kylin-2.5.1 (RC1)

2018-11-01 Thread JiaTao Tao
👏👏👏 Here is my vote: +1 (binding) ShaoFeng Shi 于2018年11月2日周五 下午2:10写道: > Hi all, > > I have created a build for Apache Kylin 2.5.1, release candidate 1. > > Changes highlights: > > [KYLIN-3531] - Login failed with case-insensitive username > [KYLIN-3604] - Can't build cube with spark in HBase s