[jira] [Commented] (KYLIN-4967) Forbid to set 'spark.sql.adaptive.enabled' to true when building cube with Spark 2.X

ASF subversion and git services (Jira) Sun, 11 Apr 2021 20:23:10 -0700


    [ 
https://issues.apache.org/jira/browse/KYLIN-4967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17318993#comment-17318993
 ]


ASF subversion and git services commented on KYLIN-4967:
--------------------------------------------------------

Commit 6153889ce1b1fad2c409364e77ec471166654551 in kylin's branch 
refs/heads/kylin-on-parquet-v2 from Zhichao Zhang
[ https://gitbox.apache.org/repos/asf?p=kylin.git;h=6153889 ]

KYLIN-4967 Forbid to set 'spark.sql.adaptive.enabled' to true when building 
cube with Spark 2.X

With spark 2.X, when set 'spark.sql.adaptive.enabled' to true, it will impact 
the actually partition count when doing repartition with spark, which will lead 
to the wrong results for global dict and repartition by shardby column.

For example, after writing a cuboid data, kylin will repartition the cuboid 
data with 3 partition if need, but if 'spark.sql.adaptive.enabled' is true, 
spark will optimize the partition num to 1, which leads to wrong.


> Forbid to set 'spark.sql.adaptive.enabled' to true when building cube with 
> Spark 2.X
> ------------------------------------------------------------------------------------
>
>                 Key: KYLIN-4967
>                 URL: https://issues.apache.org/jira/browse/KYLIN-4967
>             Project: Kylin
>          Issue Type: Bug
>    Affects Versions: v4.0.0-beta
>            Reporter: Zhichao  Zhang
>            Assignee: Zhichao  Zhang
>            Priority: Minor
>             Fix For: v4.0.0-GA
>
>
> With spark 2.X, when set 'spark.sql.adaptive.enabled' to true, it will impact 
> the actually partition count when doing repartition with spark, which will 
> lead to the wrong results for global dict and repartition by shardby column.
> For example, after writing a cuboid data, kylin will repartition the cuboid 
> data with 3 partition if need, but if 'spark.sql.adaptive.enabled' is true, 
> spark will optimize the partition num to 1, which leads to wrong.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Commented] (KYLIN-4967) Forbid to set 'spark.sql.adaptive.enabled' to true when building cube with Spark 2.X

Reply via email to