[
https://issues.apache.org/jira/browse/KYLIN-4185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17089387#comment-17089387
]
ASF subversion and git services commented on KYLIN-4185:
--------------------------------------------------------
Commit 322b662b2473ce2464dc454a650c33d472a9cb98 in kylin's branch
refs/heads/document from Kang
[ https://gitbox.apache.org/repos/asf?p=kylin.git;h=322b662 ]
add doc for KYLIN-4185 (#1071)
* add doc for KYLIN-4185
* update in _docs31
* KYLIN-4185 Fix some description of config
Co-authored-by: nichunen <[email protected]>
> CubeStatsReader estimate wrong cube size
> ----------------------------------------
>
> Key: KYLIN-4185
> URL: https://issues.apache.org/jira/browse/KYLIN-4185
> Project: Kylin
> Issue Type: Improvement
> Reporter: ZhouKang
> Assignee: ZhouKang
> Priority: Major
> Fix For: v3.1.0
>
>
> CubeStatsReader estimate wrong cube size, which cause a lot of problems.
> when the estimated size is much larger than the real size, the spark
> application's executor number is small, and cube build step will take a long
> time. sometime the step will failed due to the large dataset.
> When the estimated size is much smaller than the real size. the cuboid file
> in HDFS is small, and there are much of cuboid file.
>
> In our production environment, both the two situation happened.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)