Re: Spark tuning within Kylin? Article? Resource?

Jon Shoberg Mon, 17 Dec 2018 19:16:59 -0800

Greatly appreciate the response.

I started there but after OOM errors I started to work on the settings for
my test lab. After minimal success thought to ask if there was something
more in-depth for tuning with other Kylin users found successful.


Right now I've gone to very basic configuration with dynamic allocation and
see if I can avoid the late-stage OOM errors.

J

On Mon, Dec 17, 2018 at 7:44 PM JiaTao Tao <[email protected]> wrote:

> Hope this may help: http://kylin.apache.org/docs/tutorial/cube_spark.html
>
> Jon Shoberg <[email protected]> 于2018年12月18日周二 上午2:34写道：
>
>> Is there a good/favorite article for tuning spark settings within Kylin?
>>
>> I finally have Spark (2.1.3 as distributed with Kylin 2.5.2) running on
>> my systems.
>>
>> My small data set (35M records) runs well the default settings.
>>
>> My medium data set (4B records, 40GB compressed source file, 5 measures,
>> 6 dimensions with low carnality) often dies at Step 3 (Extract Fact Table
>> Distinct Columns) with out of memory errors.
>>
>> After using exceptionally large memory settings the job completed but I'm
>> trying to see if there is an optimization possible.
>>
>> Any suggestions or ideas?  I've searched/read on spark tuning in general
>> but otherwise feeling I'm not making too much progress on optimizing with
>> the settings I've tried.
>>
>> Thanks!J
>>
>
>
> --
>
>
> Regards!
>
> Aron Tao
>

Re: Spark tuning within Kylin? Article? Resource?

Reply via email to