Thanks ShaoFeng,I will check the code base to understand how these parameters 
are effecting the cube estimations, that will help us gauge most appropriate 
values for the same. (Till we plan to upgrade 2.5.1)I have a question though, 
If we chabge these parameters for the already existing cubes and refresh the 
already existing segments, that will create the new segment for the required 
date range, with preferred region size or ?Maybe after that we can run the 
kylin cleanup to delete these tables from Hbase.Other option is to manually 
merge , the regions of this existing segment table into our preferred region 
numbers. Hope Kylin will update its metadata accordingly in this case, without 
any issues or ?Thanks,[email protected] Sent from my Samsung Galaxy 
smartphone.
-------- Original message --------From: ShaoFeng Shi <[email protected]> 
Date: 06/11/2018  8:36 am  (GMT+05:30) To: dev <[email protected]> Subject: 
Re: Understanding about region cut size for base Hi Ketan,Kylin estimates the 
HBase table size; The estimation might be inaccuratewhen there are some 
advanced measures like TopN, Count distinct. Theaccuracy was improved in v2.5.0 
by KYLIN-3453. For previous versions, youmay need to manually give smaller 
value to these 
parameters:kylin.cube.size-estimate-ratio=0.25kylin.cube.size-estimate-memhungry-ratio=0.05ketan
 dikshit <[email protected]> 于2018年11月5日周一 下午10:13写道:> Hi Team> I 
would like to understand how does the> 'kylin.storage.hbase.region-cut-gb’ 
property works.> We are currently using kylin 2.3.1, We are going with the 
default property> value ie; kylin.storage.hbase.region-cut-gb=5>> But still we 
see some segments not adhering to this property; example:>> Segment: 
20180723000000_20180730000000>> Start Time: 2018-07-23 00:00:00> End Time: 
2018-07-30 00:00:00> Source Count: 447860691> HBase Table: KYLIN_ENX1MBQAMX> 
Region Count: 500> Size: 49.57422 GB> Segment: 20181005000000_20181006000000>> 
Start Time: 2018-10-05 00:00:00> End Time: 2018-10-06 00:00:00> Source Count: 
52522716> HBase Table: KYLIN_PG5PQBJ910> Region Count: 47> Size: 6.16309 GB> 
Segment: 20181010000000_20181011000000>> Start Time: 2018-10-10 00:00:00> End 
Time: 2018-10-11 00:00:00> Source Count: 62012099> HBase Table: 
KYLIN_I4QS9A4AHL> Region Count: 52> Size: 6.98145 GB>> Along with the same, we 
are also using compression,> 'kylin.storage.hbase.compression-codec=lz4’> The 
number of regions need to be kept in control, for our Hbase cluster to> be 
performant.>> Please share the understanding, how this property works, and what 
can be> the possible reasons why it is not working as intended.>> Thanks,> 
Ketan@Exponential>>-- Best regards,Shaofeng Shi 史少锋

Reply via email to