Hi Miao, Kylin use HyperLogLog or Bitmap to persistent the distinct values; You can get some info from this blog: https://kylin.apache.org/blog/2016/08/01/count-distinct-in-kylin/
2017-11-30 9:25 GMT+08:00 崔苗 <[email protected]>: > Hi, > we want to get count(distinct user) group by hour/day/week/month/year,now > we have a problem: > what's the content of count(distinct user) that kylin keeps,the distinct > users set or just a count number? If we want to count (distinct user) by > year,do we need to keep data for a year in hive? > > > -- Best regards, Shaofeng Shi 史少锋
