Sorry for the delayed response. what's the cardinality of the dimension that you want to count distinct values? --> We might be coming across different types of cardinality for the measure. Though unsigned int capacity should cover almost all cases, there might be some cases we miss.
For example, if you want to count distinct users, use the numeric user_id, instead of email address; --> We will see if we can come up with a mapping function and use that for distinct count cast Long to Int may cause precision losing --> i remember seeing something like, good to know its removed and will be introduced later after the fix Regards, Abhilash On Fri, Jan 29, 2016 at 4:51 PM, Sarnath <[email protected]> wrote: > Yes. I was just hinting at practically faster compute using bloom filter. > Will need a way to handle probablistic answers >
