[ https://issues.apache.org/jira/browse/KYLIN-2088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15574410#comment-15574410 ]
liyang commented on KYLIN-2088: ------------------------------- Right now, the MeasureType extension point represents a combination of aggregation function and data type. If follow this design, then two MeasureType need to be declared -- COUNT_DISTINCT(BITMAP) and INTERSECT_COUNT(BITMAP). Also the current UDF is a little duplicated with MeasureType, I may combine them into one. > Support intersect count for calculation of retention or conversion rates > ------------------------------------------------------------------------ > > Key: KYLIN-2088 > URL: https://issues.apache.org/jira/browse/KYLIN-2088 > Project: Kylin > Issue Type: New Feature > Components: Query Engine > Reporter: Yerui Sun > Assignee: Yerui Sun > Attachments: KYLIN-2088.patch > > > Retention or Conversion Rates is very important in data analyze. > It can be calculated from two dataset of two different value of one > dimension. For example, we have an count distinct measure, like uv(dataset of > uuid), and one dimension, like date, and the retention of uv between > '20161015' and '20161016' is the intersection of two uv datasets. > Fortunately, we have implement dataset in Kylin, as bitmap, for precisely > count distinct. Only an UDAF is needed to calculate intersection of two or > more bitmaps. > I'll try on this and post patch later. -- This message was sent by Atlassian JIRA (v6.3.4#6332)