[ 
https://issues.apache.org/jira/browse/KYLIN-2088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15574410#comment-15574410
 ] 

liyang commented on KYLIN-2088:
-------------------------------

Right now, the MeasureType extension point represents a combination of 
aggregation function and data type. If follow this design, then two MeasureType 
need to be declared -- COUNT_DISTINCT(BITMAP) and INTERSECT_COUNT(BITMAP).

Also the current UDF is a little duplicated with MeasureType, I may combine 
them into one.

> Support intersect count for calculation of retention or conversion rates
> ------------------------------------------------------------------------
>
>                 Key: KYLIN-2088
>                 URL: https://issues.apache.org/jira/browse/KYLIN-2088
>             Project: Kylin
>          Issue Type: New Feature
>          Components: Query Engine
>            Reporter: Yerui Sun
>            Assignee: Yerui Sun
>         Attachments: KYLIN-2088.patch
>
>
> Retention or Conversion Rates is very important in data analyze. 
> It can be calculated from two dataset of two different value of one 
> dimension. For example, we have an count distinct measure, like uv(dataset of 
> uuid), and one dimension, like date, and the retention of uv between 
> '20161015' and '20161016' is the intersection of two uv datasets.
> Fortunately, we have implement dataset in Kylin, as bitmap, for precisely 
> count distinct. Only an UDAF is needed to calculate intersection of two or 
> more bitmaps.
> I'll try on this and post patch later.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to