> http://en.wikipedia.org/wiki/Locality-sensitive_hashing > corpus, now, the above may represent a solution, in such a case, the > rebuild may just "skip" messages if they're repeated too often, so > allowing to keep the corpus balanced > also say that the email was recognized as "spam" (or ham, whatever); > now, a second incoming mail hitting the same "LSH bucket" as the first
Also, there's is an opensource library implementing LSH http://lshkit.sourceforge.net/ the library is written in C++ and can be built for windows, linux and BSD; this means that, implementing LSH in ASSP may just mean putting together some Perl "glue" code leveraging the LSH library calls :) ! ------------------------------------------------------------------------------ Try before you buy = See our experts in action! The most comprehensive online learning library for Microsoft developers is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3, Metro Style Apps, more. Free future releases when you subscribe now! http://p.sf.net/sfu/learndevnow-dev2 _______________________________________________ Assp-test mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/assp-test
