Hi, shaofeng: Thank you for your suggestion! I'll give it a try. -- Best regards,
Xi Chen -----Original Message----- From: ShaoFeng Shi <shaofeng...@apache.org> Sent: Monday, November 5, 2018 4:06 PM To: dev <dev@kylin.apache.org> Subject: Re: How to increase split number for Fact distinct columns when using spark engine?(picture added) Hi Xi, The core is the same; By default, MR and Spark run a container/task for one file block. If we can split the data to more splits, will get more concurrent tasks: kylin.engine.mr.mapper-input-rows=500000 陈熹(chenxi07)-技术产品中心 <chenx...@qiyi.com> 于2018年11月5日周一 下午3:52写道: > Hi, shaofeng: > Thank you for your reply. > I've checked the doc, it does not contain tuning for spark engine. > > -- > Best regards, > > Xi Chen > > -----Original Message----- > From: ShaoFeng Shi <shaofeng...@apache.org> > Sent: Monday, November 5, 2018 3:30 PM > To: dev <dev@kylin.apache.org> > Subject: Re: How to increase split number for Fact distinct columns > when using spark engine?(picture added) > > Please check this doc: > https://kylin.apache.org/docs/howto/howto_optimize_build.html > > 陈熹(chenxi07)-技术产品中心 <chenx...@qiyi.com> 于2018年11月5日周一 下午3:25写道: > > > Hi: > > > > I’m sorry the picture is dead again. > > > > I upload it as attachment this time > > > > > > > > -- > > > > Best regards, > > > > > > > > Xi Chen > > > > > > > > > > > > *From:* 陈熹(chenxi07)-技术产品中心 <chenx...@qiyi.com> > > *Sent:* Monday, November 5, 2018 3:04 PM > > *To:* dev@kylin.apache.org > > *Subject:* How to increase split number for Fact distinct columns > > when using spark engine?(picture added) > > > > > > > > Hi, ALL: > > > > I’m using spark engine to build cube. > > > > Now I found the bottleneck of build time lies in the #3 Step Name: > > Extract Fact Table Distinct Columns. > > > > When I look into the spark application, I found there is only two > > splits regardless of how large the input sequence file is. > > > > I wonder how to increase the number of split for this step? > > > > I’m new to spark and any help will be great thanks! > > > > > > > > P.S. Spark job of #3 Step Name: Extract Fact Table Distinct Columns. > > > > -- > > > > Best regards, > > > > > > > > Xi Chen > > > > > > > > > > > > > -- > Best regards, > > Shaofeng Shi 史少锋 > -- Best regards, Shaofeng Shi 史少锋