This is very cool, nice job! We've been using HyperLogLog on another project, although that particular part of our stack is in Scala. Awesome to see it implemented in JS, though!
On Friday, May 24, 2013 10:37:54 PM UTC-4, John Hurliman wrote: > > Hi all, > > https://github.com/jhurliman/node-streamcount > > I just published the streamcount module to npm. It implements the > HyperLogLog and Count-Min sketch data structures for real-time counting of > unique IDs (useful for unique visitor analytics) or tracking counts of the > top most observed IDs (useful for top viewed pages/videos/products/etc > analytics). These are probabilistic approaches, so you are trading off a > few percent error margin for a drastically reduced and fixed amount of > memory usage. For more background on real-time analytics challenges see < > http://www.americanscientist.org/issues/pub/the-britney-spears-problem>. > > The data structures also support serialization, deserialization, and > merging. In the web analytics example, you can run this code on each > webserver and periodically aggregate statistics elsewhere. > > Thanks! > John > -- -- Job Board: http://jobs.nodejs.org/ Posting guidelines: https://github.com/joyent/node/wiki/Mailing-List-Posting-Guidelines You received this message because you are subscribed to the Google Groups "nodejs" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/nodejs?hl=en?hl=en --- You received this message because you are subscribed to the Google Groups "nodejs" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.
