[ 
https://issues.apache.org/jira/browse/KYLIN-4818?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17244447#comment-17244447
 ] 

ASF GitHub Bot commented on KYLIN-4818:
---------------------------------------

hit-lacus edited a comment on pull request #1485:
URL: https://github.com/apache/kylin/pull/1485#issuecomment-738777923


   ## Cuboid size need to be rewrote
   
   Source Data | Path         | Size 
   --                   | --             | --
   part_dt=2020-10-11 | /Lacus/data/UserActionStream/part_dt=2020-10-11 |  72.8 
MB
   part_dt=2020-10-12 | /Lacus/data/UserActioStream/part_dt=2020-10-12 |  72.8 
MB
   part_dt=2020-10-13 | /Lacus/data/UserActionStream/part_dt=2020-10-13 | 439.2 
M
   
   
   Name | Value
   -- | --
   Source Data                       |  145 MB
   Segment Data                    |  221.45 MB
   Real Expansion Rate         | 152 %
   Expected Expansion Rate | 1500 %
   
   
   Real expansion rate is far different from expected expansion rate .
   
   I should pay attention to `CubeStatsReader#getCuboidSizeMapFromRowCount` .  
Cuboid size estimation should re-invent ?
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


> Calculate cuboid statistics in Kylin 4
> --------------------------------------
>
>                 Key: KYLIN-4818
>                 URL: https://issues.apache.org/jira/browse/KYLIN-4818
>             Project: Kylin
>          Issue Type: Sub-task
>          Components: Spark Engine
>            Reporter: Xiaoxiang Yu
>            Assignee: Xiaoxiang Yu
>            Priority: Major
>             Fix For: v4.0.0-beta
>
>
> Refer to SparkFactDistinct.java in Kylin 3, I will try to use spark to 
> calculate(estimate) rowcount/size for cuboid candidate. Rowcount/size of 
> cuboid si the input for cubeplanner phase one and phase two.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to