[ 
https://issues.apache.org/jira/browse/KYLIN-4185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17038184#comment-17038184
 ] 

ASF subversion and git services commented on KYLIN-4185:
--------------------------------------------------------

Commit a979a49932ef0b4bcd48d518559f6df6840e6a22 in kylin's branch 
refs/heads/master from Zhou Kang
[ https://gitbox.apache.org/repos/asf?p=kylin.git;h=a979a49 ]

KYLIN-4185: optimize CuboidSizeMap by using historical segments


> CubeStatsReader estimate wrong cube size
> ----------------------------------------
>
>                 Key: KYLIN-4185
>                 URL: https://issues.apache.org/jira/browse/KYLIN-4185
>             Project: Kylin
>          Issue Type: Improvement
>            Reporter: ZhouKang
>            Assignee: ZhouKang
>            Priority: Major
>
> CubeStatsReader estimate wrong cube size, which cause a lot of problems.
> when the estimated size is much larger than the real size, the spark 
> application's executor number is small, and cube build step will take a long 
> time. sometime the step will failed due to the large dataset.
> When the estimated size is much smaller than the real size. the cuboid file 
> in HDFS is small, and there are much of cuboid file.
>  
> In our production environment, both the two situation happened.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to