[ 
https://issues.apache.org/jira/browse/KYLIN-2353?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16640322#comment-16640322
 ] 

Ruslan Dautkhanov commented on KYLIN-2353:
------------------------------------------

[~kangkaisen] thank you for this great improvement. 
Would you recommend bitmapCounter for highly-cardinal columns? 
I assume it will work super fast for low-cardinal columns like `product type`, 
but would 
it work on highly cardinal columns, let's say if number of distintinct values 
in a column 
`household_id` is *1 billion*, would Bitmap Counter and Kylin general handle 
`count(distinct household_id)` very well? 

> Serialize BitmapCounter with distinct count
> -------------------------------------------
>
>                 Key: KYLIN-2353
>                 URL: https://issues.apache.org/jira/browse/KYLIN-2353
>             Project: Kylin
>          Issue Type: Improvement
>          Components: Metadata
>    Affects Versions: v1.6.0
>            Reporter: kangkaisen
>            Assignee: kangkaisen
>            Priority: Major
>             Fix For: v2.0.0
>
>         Attachments: KYLIN-2353.patch
>
>
> Currently, we deserialize the bitmap whether we need to aggregate or not.
> Actually, we could serialize {{BitmapCounter}} with bitmap counter and delay 
> to deserialize bitmap until we need to aggregate bitmap and only get the 
> counter for the bitmap when deserialize.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to