Hi, shaofeng:
        Thank you for your suggestion! I'll give it a try.

--
Best regards,

Xi Chen



-----Original Message-----
From: ShaoFeng Shi <shaofeng...@apache.org> 
Sent: Monday, November 5, 2018 4:06 PM
To: dev <dev@kylin.apache.org>
Subject: Re: How to increase split number for Fact distinct columns when using 
spark engine?(picture added)

Hi Xi,

The core is the same; By default, MR and Spark run a container/task for one 
file block. If we can split the data to more splits, will get more concurrent 
tasks:

kylin.engine.mr.mapper-input-rows=500000

陈熹(chenxi07)-技术产品中心 <chenx...@qiyi.com> 于2018年11月5日周一 下午3:52写道:

> Hi, shaofeng:
>         Thank you for your reply.
>         I've checked the doc, it does not contain tuning for spark engine.
>
> --
> Best regards,
>
> Xi Chen
>
> -----Original Message-----
> From: ShaoFeng Shi <shaofeng...@apache.org>
> Sent: Monday, November 5, 2018 3:30 PM
> To: dev <dev@kylin.apache.org>
> Subject: Re: How to increase split number for Fact distinct columns 
> when using spark engine?(picture added)
>
> Please check this doc:
> https://kylin.apache.org/docs/howto/howto_optimize_build.html
>
> 陈熹(chenxi07)-技术产品中心 <chenx...@qiyi.com> 于2018年11月5日周一 下午3:25写道:
>
> > Hi:
> >
> >        I’m sorry the picture is dead again.
> >
> >        I upload it as attachment this time
> >
> >
> >
> > --
> >
> > Best regards,
> >
> >
> >
> > Xi Chen
> >
> >
> >
> >
> >
> > *From:* 陈熹(chenxi07)-技术产品中心 <chenx...@qiyi.com>
> > *Sent:* Monday, November 5, 2018 3:04 PM
> > *To:* dev@kylin.apache.org
> > *Subject:* How to increase split number for Fact distinct columns 
> > when using spark engine?(picture added)
> >
> >
> >
> > Hi, ALL:
> >
> >        I’m using spark engine to build cube.
> >
> > Now I found the bottleneck of build time lies in the #3 Step Name:
> > Extract Fact Table Distinct Columns.
> >
> > When I look into the spark application, I found there is only two 
> > splits regardless of how large the input sequence file is.
> >
> > I wonder how to increase the number of split for this step?
> >
> > I’m new to spark and any help will be great thanks!
> >
> >
> >
> > P.S. Spark job of #3 Step Name: Extract Fact Table Distinct Columns.
> >
> > --
> >
> > Best regards,
> >
> >
> >
> > Xi Chen
> >
> >
> >
> >
> >
>
>
> --
> Best regards,
>
> Shaofeng Shi 史少锋
>


--
Best regards,

Shaofeng Shi 史少锋

Reply via email to