Hi all, https://github.com/jhurliman/node-streamcount
I just published the streamcount module to npm. It implements the HyperLogLog and Count-Min sketch data structures for real-time counting of unique IDs (useful for unique visitor analytics) or tracking counts of the top most observed IDs (useful for top viewed pages/videos/products/etc analytics). These are probabilistic approaches, so you are trading off a few percent error margin for a drastically reduced and fixed amount of memory usage. For more background on real-time analytics challenges see <http://www.americanscientist.org/issues/pub/the-britney-spears-problem>. The data structures also support serialization, deserialization, and merging. In the web analytics example, you can run this code on each webserver and periodically aggregate statistics elsewhere. Thanks! John -- -- Job Board: http://jobs.nodejs.org/ Posting guidelines: https://github.com/joyent/node/wiki/Mailing-List-Posting-Guidelines You received this message because you are subscribed to the Google Groups "nodejs" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/nodejs?hl=en?hl=en --- You received this message because you are subscribed to the Google Groups "nodejs" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.
